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Preface 


Polynomials have been around for a long time. They play a prominent role 
in many applications and are basic objects in commutative algebra and algebraic 
geometry. In the 20th century, the foundations of algebraic geometry required a 
level of abstraction that made these tools hard to use for many people interested in 
applications. But with the advent of powerful algorithms, personal computers, and 
the theory of sparse polynomials, we now have computational commutative algebra 
and computational algebraic geometry. Coupled with accessible introductory texts, 
recent years have witnessed a remarkable range of applications. 

The goal of this volume is to introduce some of these applications. The five 
chapters are based on the NSF-CBMS Regional Research Conference Applications 
of Polynomial Systems held at Texas Christian University (TCU), June 4-8, 2018. 
The format of the conference, replicated in this book, is that each day David Cox 
gave two lectures on a topic, followed by a third lecture by an expert on the topic. 

The book is not a complete introduction to the topics covered. Many proofs 
are sketched or omitted, and the emphasis is on the examples. The hope is that 
the brief presentation provided here inspires you to learn more. 

The first two chapters set the background for the rest of the book. Chapter 1 
focuses on the rich history of elimination theory from the work of Newton and 
Bézout in the 17th and 18th centuries through the decline of elimination theory in 
the middle of the 20th century, followed by the emergence of symbolic computation 
in the latter part of the century. In the final section of the chapter, Carlos D’Andrea 
brings elimination theory into the 21st century. 

Chapter 2 begins with the tension between the perfect information needed 
for symbolic computation and the messiness of the information coming from the 
real world. The chapter explores linear algebra and homotopy continuation, two 
commonly used methods for solving systems of polynomial equations numerically. 
This is where we meet our first substantial applications. Jonathan Hauenstein 
concludes the chapter with a discussion of sampling in numerical algebraic geometry. 

The remaining chapters of the book focus on three substantial applications. 
Chapter 3 explores the geometry and algebra of geometric modeling, with some 
unexpected connections to toric varieties, algebraic statistics, and Rees algebras. 
At the end of the chapter, Hal Schenck surveys a variety of algebraic tools which 
are used in geometric modeling. 

Chapter 4 is devoted to the geometry and combinatorics of rigidity theory. 
For bar-and-joint frameworks, basic objects include graphs and rigidity matrices, 
though polytopes and matroids also have an important role to play. A fun result 
due to Gross and Sullivant uses the rigidity matroid to study maximum likelihood 
estimates of Gaussian graphical models coming from planar graphs. Jessica Sidman 
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ends the chapter with a discussion of body-and-bar and body-and-cad frameworks 
and the study of non-generic frameworks via the Grassmann-Cayley algebra. 

Chapter 5 shifts the scene to chemical reaction networks, where the Law of 
Mass Action leads to a lovely combination of graph theory and dynamical systems. 
This framework includes examples from chemistry, biology, epidemiology, and pop- 
ulation genetics. Algebraic geometry enters the picture because the steady states 
are defined by a polynomial system. Toric varieties also have a role to play by the 
pioneering work of Karin Gatermann. Alicia Dickenstein finishes the chapter with 
an exposition of the interesting things that can happen in biochemical reaction 
networks. 

Each chapter starts at a relatively elementary level, with more advanced topics 
introduced as needed. The algebraic geometry background can be found in the book 
[104]. In addition to the main applications presented in Chapters 3, 4, and 5, the 
twin themes of toric varieties and algebraic statistics play a prominent role in the 
book. Readers unfamiliar with this material should consult the expository papers 
and [214]. 

We hope you enjoy the book! We had fun writing it. 
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CHAPTER 1 


Elimination Theory 


Elimination theory has important roles to play in both algebraic geometry 
and symbolic computation. The first two sections of the chapter take a historical 
approach so that you can see how elimination theory developed in the last 300 
years. In the third section Carlos D’Andrea will survey recent developments. 


1.1. Elimination Theory in the 18th and 19th Centuries 


Algebra was well-established in Europe by the end of the 17th century. The 
full story of how this happened and what followed in the subsequent two centuries 
is far beyond the scope of this volume. Readers interested in the details should 
consult (for algebra in general) and (for elimination theory). We will 
instead focus on some vignettes that illustrate the evolution of elimination theory. 


Newton and Tschirnhaus. In spite of the title of the section, we begin in the 
17th century with extracts from Newton and Tschirnhaus. In a manuscript written 
sometime between 1666 and 1671, Newton writes: 

Datis duabus curvis invenire puncta intersectionis. this is rather a principle then 

a probleme. But rather propounded of y° Algebraicall then geometricall solutions 

& y' is done by eliminating one of the two unknown quantitys out of y° equations. 

From whence it will appeare y* there are soe many cut points as the rectangle of 

the curves dimensions. p. 177] 
The Latin says “Given two curves, to find their points of intersection.” The English 
is archaic, but once you realize that “y® = the” and “yt = that”, Newton’s meaning 
is clear: 


e He solves the geometric problem by working algebraically. 

e The algebra is “done by eliminating one of the two unknown quantitys out 
of y° equations.” 

e The degree of the resulting equation is the product of the degrees of the 
curves (“the rectangle of the curves dimensions” ). 


This is a clear statement of Bézout’s Theorem, along with the strategy of reducing 
to a single variable that was to dominate the early history of elimination theory. 
Here is an application of Newton’s “principle.” 


EXAMPLE 1.1. Newton counts the number of tangent lines to a curve that go 
through a given point: 

a line drawn from a given point may touch a curve of 2 dimensions in 1 x 2 points, 

of 3 in 2 x 3 points, of 4 in 3 x 4 points, &c. p. 179] 
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We explain this as follows. Let U(x, y,z) = 0 define a smooth curve of degree of n 
in the projective plane P?. If the given point is P = (a, 6,7), then any solution of 


U=0 
OU OU OU 
“Ox oP oa To, =e 


is a point on the curve whose tangent line contains P. These curves have degrees 
n and n—1, giving n(n — 1) tangent lines by Bézout’s Theorem. This explains the 
products 1 x 2, 2 x 3 and 3 x 4 in the quote. << 


Another example of 17th century elimination can be found in the work of 
Tschirnhaus. In 1683, he published a paper on roots of a polynomial equation [348]. 
Given y? — qy—r = 0, Tschirnhaus introduces a new variable z via the T'schirnhaus 
transformation y? = by +z+a, where a and b are unknown coefficients. He writes: 

following from this let there be a third equation (by proceeding according to the 

recognized rules of analysis) in which the quantity y is absent, and we shall obtain 


2° + (3a — 2q)z” + (3a? — 4qa + q? — qb? + 3rb)z +--- =0. 


This leads to a solution of y3 — gy — r = 0 as follows: 


Pick a to make the coefficient of z? vanish. 

Use the quadratic formula to pick 6 to make the coefficient of z vanish. 
The equation reduces to z? = constant, so z is a cube root. 

e Solving y? = by + z+ for y gives a root of our original cubic. 


Tschirnhaus’s approach is based on elimination (“in which the quantity y is absent” ) 
using the “recognized rules of analysis.” 

These examples show that elimination theory was an important part of algebra 
in the 17th century. 


Cramer and Bézout. By the middle of the 18th century, Bézout’s Theorem was 
well-known. Here is Cramer writing in 1750: 
If one has two variables and two indeterminate equations ... of which one is of 
order m and the other of order n, when, by means of these equations, one expels 


one of these variables, the one that remains has, in the final equation ... at most 
mn dimensions. 


In his 1764 paper Sur le degré des équations résultantes de l’évanouissement des 
inconnues [27], Bézout uses the terms “final equation” and “resultant equation” for 
the result of the elimination. The first main result of is a version of Bézout’s 
Theorem that draws on earlier work of Euler. 


EXAMPLE 1.2. Let f(a,y) and g(x,y) have Newton polytopes: 
m! +p’ 


(m, p) m’,p’) 


Thus f has total degree m+ p in x,y, but its degree in x is bounded by m, with a 
similar story for g. 

To eliminate x from f(x,y) = g(a, y) = 0, Bézout multiplies f, g by polynomials 
G,H of degree m’ — 1,m —1 in x with unknown coefficients. Then equating the 
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coefficients of powers of x in the equation 
(1.1) Gf + Hg = h(y) 


gives a linear system of m+ m’ unknowns in m+ m’ variables with a familiar 
coefficent matrix. We illustrate this for m = m’ = 2, where 


f(x,y) =A(y)a? + Bly)a + Cy), g(x,y) = A’(y)a? + B'(y)e + C(y), 
and deg(A(y)) < p, deg(B(y)) < p+ 1, deg(C(y)) < p+ 2, with similar bounds for 
the degrees of A’(y), B'(y), C’(y). If we set 

G=Mzx+N, H=M'r+N’ 


for unknown coefficients M, N, M’, N’, then equating coefficients of x in (LI) gives 
the system of equations 


0 

A B ASIN 
BC BY |M 
C 0 CY {|N 
whose coefficient matrix is the Sylvester matrix, over 75 years before Sylvester wrote 
it down in 1840. Bézout goes on to show that when z is eliminated, the “équation 
résultant” in y has degree mm’ + mp!’ + m'p. < 


(1.2) 


SQW. 
i) 


These days, we use the term sparse elimination for Example [L.2] yet Bézout’s 
paper dates from 1764. We will soon see that Bézout had even more exotic versions 
of his theorem. 

We turn to a second example from Bézout’s paper. 


EXAMPLE 1.3. Suppose we have cubic equations 
f=Ae?+ Ba? +Czx+D=0 
g = A'z? + B'n? + C'xn + D' =0. 
Then Bézout proceeds as follows: 
(1) Multiply the first equation by A’, the second by A, and subtract to obtain 
(A'B — AB’)x? + (A'C — AC’)z + A'D — AD’ =0. 
(2) Multiply the first equation by A’x + B’, the second by Ax + B, and subtract 
to obtain 
(A'C — AC")z? + (A’D — AD! + B/C — BC’)z + B'D-— BD! =0. 
(3) Multiply the first equation by A’x? + B’x + C’, the second by Ax? + Bx + C, 
and subtract to obtain 
(A'D — AD’)z? + (B'D — BD')z + C'D — CD’ =0. 


Bézout then considers “each power of x as an unknown,” so that (1)-(3) give a 
system of three equations in three unknowns. The determinant of this system is 
the final equation that eliminates «x. <— 


Example [1.3] and its generalization to deg(f) = deg(g) = m (also studied by 
Bézout) is the origin of what we now call the Bézoutian of f,g. Observe that 


y" -RHS of (1) + y- RHS of (2) + RHS of (3) a f(x)9(y) = P(y)9(@) 


t—y 
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which means that up to a minus sign, the quantities A’B — AB’, A'C — AC’, etc. 
of Bézout are the coefficients b;; in the expression 


3 


(1.3) Heys fk = y ba ty} 


ij=l 


The Bézoutian matrix is Bez(f,g) = (bj;), and its determinant is the resultant 
Res(f,g). We will see later in the section that Cayley discovered this in 1857. 


Bézout and Waring. We now turn to two important books: 


e In 1779, Bézout published Théorie Générale des Equations Algébriques. 
e In 1782, Waring published the second edition of Meditationes Algebraice. 


We will go back and forth between the two books, beginning with a quote from 
Bézout’s Théorie Générale (italics added): 
We conceive of each given equation as being multiplied by a special polynomial. 
Adding up all those products together, the result is what we call the sum-equation. 
This sum-equation will become the final equation through the vanishing of all 
terms affected by the unknowns to eliminate. (224.)| 


Here are the terms in italics, expressed in modern language: 


e We have given equations f, =--- = fy = 0 in x,...,2, (for Bézout, the 
number of equations equals the number of unknowns). 

e Multiply each by special polynomials A,,..., An. 

e Adding then up gives the sum-equation A, fy +---+Anfn = 0. 

e This becomes the final equation when all terms involving the unknowns to 
eliminate vanish, e.g., 


Afi +:+>+Anfn = k{ax4]. 


Thus the elimination ideal (f1,..., fn) A &[a1] is implicit in Bézout. But set theory 
lay over 100 years in the future, so objects like (f1,..., fn) were not part of Bézout’s 
mental landscape—he worked with polynomials finitely many at a time. Even 
writing k[21] is problematic, for Bézout never specified a field. His definition of 
polynomial simply says “coéfficiens quelconques” (“any coefficients” ). 

Now let’s switch to Waring’s Meditationes Algebraica. This book covers an 
impressive range of topics, from symmetric polynomials to the Waring problem 
about expressing positive integers as sums of kth powers. We will focus on Waring’s 
comments about Bézout’s Théorie Générale, which appeared three years before the 
second edition of Meditationes Algebraice [358]. 

Here is a quote from Waring, where we have added bullets and comments 
enclosed in [...] that indicate the relation to Bézout: 

If there are h equations [given equations], of degrees n,m, l,k, etc., respectively, 

in as many unknowns, and 

e if each of these h equations are multiplied by h assumed equations [special 
polynomials] of degrees nmlk---—n, nmlk---—m, nmlk-+-—l, nmlk---—k, 
etc., respectively, then 

e let the resulting equations be added together [swm-equation], and 

e equate to zero the coefficients of each term of degree nmlk--- in the result 
(barring those involving only powers of y), 

whence the equation whose root is x or y or 2, etc. [final equation], cannot be of 

degree greater than nmlk--- [358] p. xxxv, English translation] 
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This is Bézout’s Theorem for n polynomial equations in n unknowns, cast as a 
problem in elimination theory. We will state Bézout’s version below. 

Here is another of Bézout’s results from Théorie Générale, again quoting from 
Waring with bullets added: 


If there are h equations respectively of degrees n,m,l,k,..., 


e all involving the same unknown uence L,Y, 2,v,..., and 
e if pT, 85.6.5 DoT, 8 5. Pd ar" .s",..., etc., are the maximum de- 
grees to which x, y, z,v,... appear in the equations of degrees n,m,1,k,... 
respectively, 
e then the equation whose root is x or y or z, etc., cannot be of degree higher 
than 
nxmxIlxkx---—(n—p) x (m—p’) x (l—p”) x--- 
—(n—q) x (m—q')x (l—q") x + 
—(n—r) x (m=1')x (l-r")x--: 


—--- [358] p. 209, English translation] 


The key feature here is that we fix the total degree of each polynomial and put 
separate bounds on the degree in each variable. This differs from Example [1.2] 
where in 1764 Bézout fixed the total degree and bounded the degree of just the 
first variable. 


Bézout’s Théorie générale. The results quoted from Waring are just the a 
of the iceberg when it comes to the “Bézout theorems” in Théorie Générale [28]. 
This amazing book consists of 469 pages of text organized into sections (1.)— on Vis 

Bézout had an elaborate classification of polynomials and systems of equations. 
He worked affinely with polynomials of degree T in n variables. A polynomial is 
complete when it uses all monomials of total degree < T. We are now ready for 
Bézout’s first theorem: 

The degree of the “équation finale résultant” of any number of complete equations 

containing the same number of unknowns, and of any degree, is equal to the 

product of the exponents of the degrees of these equations. (47.)]. 

To relate this to the number of solutions, recall that the “équation finale résultant” 
comes from eliminating all variables but one, say x1, so its roots are the 2 1- 
coordinates of solutions of the system. Hence, if the number of solutions is finite 
with distinct +; coordinates, then the number of solutions is bounded by the “prod- 
uct of the exponents of the degrees” since some solutions might be at oo (recall that 
we are working affinely). 

For Bézout, a polynomial is incomplete when it is not complete. Incomplete 
polynomials fall into various species. For example, the second result of Bézout 
quoted from Waring is for polynomials of the first species. To indicate how deeply 
Bézout thought about polynomials, let’s explore his third species in detail. 


EXAMPLE 1.4. A polynomial in u, x,y belongs to the third species considered 
by Bézout when it is satisfies the following conditions: 


1.° that wu does not exceed the degree a, x does not exceed the degree a, y does 
not exceed the degree a; 


2.° that u with x does not rise above the dimension b; u with y does not rise 
above the dimension 6; « with y does not rise above the dimension b; 
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3.° that wu with x and with y cannot together rise to a dimension higher than t. 
(82.) 
Then a monomial u"'!a*2y"* is of the third species considered by Bézout if there 
are constants t,a,a, a,b, b, b (in Bézout’s awkard notation) such that 


ee ata ee 
(1.4) O<ky <a, O<kg <a, O<k3 <a 
ky +kg <b, ky t+k3 <b, ko t+k3 <b. 


Bézout notes that the third species has eight forms, according to the signs of 
(1.5) f=b-6+¢, t-b=b+0, t—-0—)+a. 
These days, we regard the inequalities (4) as describing the lattice points in a 


polytope, and the eight forms indicate give eight possibilities for what the polytope 
can look like. Here is a drawing of four of these, taken from |[288}: 


A A 
f 
l I 
——— — 
bo —o 
a7 eer - 
Third species; first form = Third species; third form 


Third species: fifth form Third species; sixth form 


To a reader versed in toric geometry, this picture is astonishing. The polytopes 
share the same 10 inward-facing facet normals. These 10 vectors have a secondary 
fan whose chambers have some wonderful properties: 

e Each polytope in the picture corresponds to a chamber. 


e Two chambers that share a common wall give » < flip 
polytopes that differ by a bistellar flip of an edge: 


WY 
7X 


In the picture, you can see a sequence of flips about edges: 


fli fli fli 
first form <> third form <> fifth form <> sixth form. 
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This corresponds to a path in secondary fan where we traverse four chambers, 

crossing three walls in the process. See $14.4] for more on the secondary fan. 
However, Bézout was not thinking about the splendid geometry implicit in the 

eight forms of the third species. Rather, he wanted the degree of the “équation 

finale résultant” of a system f = g = h = 0 when the polynomials have the same 

form of the third species. In (120.)—(127.)], he states eight Bézout Theorems 

when f,g,h satisfy (14) for parameters 

(1.6) (t,a, a, a,b, b, b), (t',a’,a’,a’,b',b',b'), (e ql! a a bp! ve b”). 

If in addition f,g,h are all of the first form, where the quantities in (L.5) are all 

negative, then the “équation finale résultant” has degree 


D=ttt" — (t—a)(t = a').4" =a") —G—@).(0 =—a).0" =a") 
=o). —@). —a je G—br =b).' 8) 
+ (¢—B).(Y — B').(e" — BY) + (8). — 8)" — 8”) 
(a+a—b).(t' —b').(t" — b”) — (a’ +a’ — ’).(t — b).(t" — b”) 
en (a" +0" —b").(t 0). — 8) — (ata —).(t! —B).(t" — 8) 
—(a' +a’ —0').(t—b).(t” — B") - (a" +4" — b").(t— ).(t! - 8) 
(@-+a—5).(¢ —b/).(t” — b”) — a’ +a" — 8/).(t—B).(" — b”) 
— (9” +a" b").(é — b).0" — 8°), 


uw 


where we follow the typsetting used in (120.)]. Bézout gives similar formulas 
when the polynomials all have the second form, the third form, etc. 

Here’s a happening from the modern point of view. Assume that the 
parameters of f,g,h € Clu,x,y] are of the first form and that f,g,h are 
generic ie me Newton polytopes. Then: 


e The above formula for D is the mized volume of the Newton polytopes of 
the polynomials f,g,h. We will verify this in Section [I.2] 

e By Bernstein’s Theorem (see Theorem [L.12]in Section [L.2), D is the number 
of solutions of f = g = h = 0 in the torus (C*)?. 

e The minimal generator of the elimination ideal (f,g,h) ™C[u] has degree D. 


This polynomial is Bézout’s “équation finale résultant.” 


It is wonderful to see so much contemporary mathematics packed into an example 
from a book published in 1779. << 


Bézout’s Proofs. In [288], Penchévre gives a careful reading of Théorie générale 
and discusses Bézout’s proofs, most of which are incomplete. But also shows 
how tools from commutative algebra (Koszul complexes) and algebraic geometry 
(toric varieties) can be used to verify many Bézout’s results, including the formula 
for D given in Example 

To give you a hint of what Bézout did, let’s say a bit more about Example[L4] 
If we set a = (a,a,a), then the data (L.6) can be written 


(t,a, b), (t’, Bis b’), (e345 b’’) 


To find the “équation finale résultant,” Bézout uses special polynomials F,G,H so 
that Ff + Gg + Hg lies in the elimination ideal. He picks F to have parameters 
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(T —t,A—a,B-—b) so that Ff has parameters (T,A,B), and similarly for G, H. 
In modern language, this leads to the linear map 


(f.9.h) 


o:C, 8C26C3 > Oo, 


where all possible F’s lie in Cy, etc. Bézout is interested in dim coker ¢, which he 
describes by saying useful coefficients and useless coefficients. Then: 


e The dimension of Cp is the number of lattice points in the polytope for 
parameters (T,A,B), and similarly for C1, C2, C3. 

e This number is a quasi-polynomial in the parameters, with a separate polyno- 
mial for each of the eight forms. Bézout computes these in (92.)—(99.)]. 

e Bézout’s formula for coker ¢ implicitly assumes that the Koszul complex of 
f,g,h is exact, which means that the only cancellations that occur are the 
obvious ones. Bézout explains this by saying “the number of terms cannot 
be made lower by introducing fictitious terms.” (112.)] 


All of this is discussed in detail in [288] Section 6]. There is a lot of math going in 
Bézout’s book! 


Poisson. While a student at the Ecole Polytechnique, Poisson wrote the paper 
Mémoire sur l’élimination dans les équations algébriques, published in 1802. Here 
is how his paper begins: 

The degree of the “équation finale résultant” from the elimination, among a 

number of equations equal to that of the unknowns, of all the unknowns except 

one, cannot be greater than the product of the exponents of these equations; 

and it is precisely equal to this product, when the given equations are the most 

general of their degrees. This important theorem is Bézout’s, but the way he 

proves it is neither direct nor simple; nor is it devoid of any difficulty. 
Poisson’s goal is to prove Bézout’s Theorem in the case of complete polynomials, 
and he observes that Bézout’s bound is sharp in the generic case (“most general 
of their degrees”). The final sentence of the quote is telling—like many of his 
contemporaries, Poisson found Bézout’s book to be difficult reading. This happened 
so often that in 1907, Brill and Noether p. 143] would comment that Bézout’s 
Théorie générale is “ebenso viel beriihmten als wenig gelesen” (“as well known as 
little read”). 

Because of this, Poisson provides his own proof, which proceeds by induction 
on n, the number of equations in the system f; =--: = fy, = 0. The inductive step 
uses the product 

ial fal), 


fi(p)==fn-1(p)=0 
which is now part of the Poisson formula for the multivariable resultant (see, for 
example, Exercise 3.3.8]). For the base case n = 2 of the induction, Poisson 
cites Cramer’s 1750 paper [116], where Cramer uses the product ITs, (p)=0 fo(p). 
Poisson’s paper brings us to the end of our discussion of elimination in the 18th 


century. Readers interested in a fuller picture of what happened in this century 
should consult Penchévre’s article [286]. 


The 19th Century. The century that followed Poisson’s 1802 paper witnessed an 
explosion of work on elimination theory. In 1892, Netto and Le Vavasseur wrote 
a 232 page review article about algebra in the 19th century for l’Encyclopédie des 
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sciences mathématiques pures et appliquées. Their section on elimination theory is 
97 pages long and begins as follows: 


The large number and variety of memoirs relating to elimination makes it difficult 
to classify these memoirs rationally. pp. 73-169] 


A more recent account of elimination theory in the 19th century can be found 
in Penchevre’s 318 page PhD thesis Histoire de la théorie de l’élimination 
Chapters 12-16]. 

There is no way we can do justice to this vast amount of material. We will 
instead focus on a few selected topics, beginning with resultants. 


The Emergence of Resultants. After Poisson, elimination theory had a low 
profile in the first few decades of the 19th century. In 1835, Jacobi used 
elimination theory to prove a result that became the Euler-Jacobi Formula (see 
for a modern account). But starting in 1839, multiple papers on elimination 
theory appeared in rapid succession. 

The Sylvester matrix appeared twice during this period. In 1840, Sylvester 
described a matrix whose rows are built from the coefficients of the polyno- 
mials, suitably shifted by successively adding more zeros. The result is 

a solid square (m+n) terms deep and (m+n) terms broad. 


He takes the determinant of this matrix, though he never says “determinant.” 
The same matrix was discovered independently four years later by Hesse in 
1844 [204]. He begins with polynomials 


Ap = dn 2” +Qn—1 0"! +an_2 0"? +.... +a," +9 


= m m-1 m—2 : 
Bo = bm & wr bm—12 “ay bm—2 le eee bx bo 


(he uses four dots .... rather than the three dots ... commonly used these days). He 
multiplies Ag by 2~!,2™~?,...,2,1 and Bo by 2”~!,a"~?,...,a,1. Expressing 
these in terms of the monomials «”+"—!,2™t"~?,..., 2,1, he obtains the following 


(m+n) x (m+n) matrix: 


x 


bm 
2 [0 | bm |b 
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Like Sylvester, he arranged the coefficients in rows, so this is the transpose of what 
is commonly called the “Sylvester matrix.” We saw an example of this in (1.2), 
though neither Sylvester nor Hesse seem to be aware of what Bézout did in 1764. 
These papers from 1840 and 1844 worked affinely and said “result of elimina- 
tion” rather than “resultant.” But in 1847, Cayley published a paper on elimination 
that begins as follows: 
Designating U,V,W,... homogeneous functions of orders m,n,p,&c. and an 
equal number of variables respectively, and assuming that these functions are 
the most general possible, that is to say that the coefficient of each term is an 
indeterminate letter: we know that the equations U = 0,V =0,W =0,... offera 
relation © = 0, in which the variables no longer enter, and where the function O, 
which can be called the complete Resultant of the equations, is homogeneous and 
of the order np... relative to the coefficients of U, of the order mp... relative 
to those of V, and so on, while it is not decomposable into factors. 
There is a lot going in this quote: 


e Cayley explicitly says “resultant.” 

e He uses homogeneous polynomials. We are no longer affine! 

e He states some basic properties of the resultant (degree in the coefficients of 
each polynomial, irreducibility). 


We have the beginnings of a general theory of resultants in 1847! 

The switch to homogeneous polynomials was driven by two emerging areas of 
mathematics: projective algebraic geometry, where points in projective space are 
described by homogeneous coordinates, and invariant theory, where invariants are 
naturally homogeneous (an example is the determinant, invariant under the action 
of SL(n)). To get a better sense of how the story unfolded, let’s spend some more 
time with Cayley. 


Cayley. In 1845 and 1846, Cayley wrote two papers on invariant theory that 
introduced hyperdeterminants. In the second paper, he observes a link with the 
resultant of two homogeneous polynomials of degree two and comments that 

Important results might be obtained by connecting the theory of hyperdetermi- 

nants with that of elimination, but I have not yet arrived at anything satisfactory 

upon this subject. [70] 

We suspect that Cayley would have enjoyed the 1994 book Discriminants, Resul- 
tants, and Multidimensional Determinants [165|, which devotes an entire chapter 
to hyperdeterminants. 

Cayley was also familiar with Bézout’s 1764 paper. In 1857, he published 
the short paper Note sur la méthode d’élimination de Bezout in Crelle’s journal 
about what he later called “Bezout’s abbreviated process of elimination.” We 
illustrated this in Example[L.3] Cayley and Sylvester wrote Bézout’s name without 
the accent. 

In his note, Cayley begins with homogenous polynomials 


U =(a,...)x,y)” =0 
V=(a,...42,y)" =0, 


1When Cayley published outside of England, he often wrote in French—in the 19th century, 
English was not the dominant mathematical language that it is today. 
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where (a, 6,c,...§a,y)”" =a2"+({)ba" "y+ (5)ca"7y?+--- is how Cayley writes 

a homogeneous polynomial of degree n in x,y. He refers to U and V as “quantics.” 

This is part of the symbolic notation that Cayley developed for invariant theory. 
Then Cayley considers the quotient 


(a,...da,y)"(a’,.. QA, mw)” — (a,..- A, mw)" (a’,.. a, y)” 
px — ry 
which we would write more simply as 
px — Ay 
As noted after Example[1.3] Bézout’s method is encoded in the coefficients of (L8) 
and the determinant of the resulting square matrix is the resultant of U and V. 
Also observe that (1.8) is the homogeneous version of what we did in (1.3). 


A year later in his Fourth Memoir Upon Quantics, Cayley calls (8) the Be- 
zoutic emanant and comments that 


? 


(1.8) 


the result of the elimination is consequently obtained by equating to zero the 
determinant formed with the matrix which enters into the expression of the Be- 
zoutic emanant. In other words, this determinant is the Resultant of the two 
quantics. 
These days, we use the simpler term “Bézoutian,” for which we should be grateful. 
Sylvester also studied Bézout’s work and in 1853 wrote a paper that 
defines Bezoutians, Bezoutics and the Bezoutian matrix. This paper also introduced 
Sylvester double sums, a subject of current interest—see [118]. 
We next discuss a geometric example from Cayley that involves elimination. 


EXAMPLE 1.5. In Example [I] Newton counted tangent lines to a curve that 
go through a given point. In his 1847 paper, Cayley poses the problem of finding 
the equations of these tangent lines: 

Find the equation of the system of tangents drawn from a fixed point to a given 

curve. 

As before, we assume that U(x, y,z) = 0 defines a smooth curve of degree of n in 
P? and that the given point is P = (a, 6,7). In order for a point Q = (€,7,¢) to 
lie on a tangent line going through P, we must satisfy the system of equations 


U=0 

aol pee age ai 
(1.9) Ox Oy Oz 
En ¢ 

det |x y z] =0. 
a p ¥ 


The first two equations tell us that (a, y,z) is a point on the curve whose tangent 
line goes through P, and the third equation says that Q lies on this tangent. 

For this system, Cayley regards the variables as x,y, z, while a, 8, y,&,7,¢ are 
unknown coefficients. He then takes the resultant © of (19), where O is from the 
quote at the top page You should reread this quote before proceeding further. 

The resultant © is a polynomial in a, 3,y7,&,7,¢ and the coefficients of U. He 
notes that (19) has an obvious solution when (qa, 3,7) is on the curve (courtesy of 
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the Euler relation cU, + yU, +2U, = nU), so that U(a, 8,7) is a factor of O. Thus 
O= U(a, 6,7) ‘ ®, 


where ® is the defining equation of the tangent lines when regarded as a polynomial 
in €,7,¢. In Cayley’s terminology, ® is a reduced resultant. <- 


Although the resultant is irreducible when the coefficients of the system are 
independent unknowns, extraneous factors can arise for the resultant of a particular 
system, such as (1.9). We will see some interesting examples of extraneous factors 
in Chapter 

We close our discussion of Cayley with a quote from a paper on elimination he 
wrote in 1864: 

In the problem of elimination, one seeks the relationship that must exist between 

the coefficients of a function or system of functions in order that some particular 

circumstance (or singularity) can occur. 

It is clear that for Cayley, elimination is at the heart of algebraic geometry. 

Before we move on to the general theory of resultants, we need to discuss an 
interesting 1841 paper of Minding that illustrates a different flavor of elimination. 


Minding. While the 19th century focused mostly on resultants of complete poly- 
nomials (in Bézout’s terminology), we have seen that Bézout himself studied more 
general situations. But Minding’s 1841 paper Ueber die Bestimmung des Grades 
einer durch Elimination hervorgehenden Gleichung goes far beyond anything 
Bézout could have imagined. Minding considers a system of (affine) equations 


f(x,y) = Aoy™ + Ary”? +--+ Am-1y + Am =0 
O(x,y) _ Boy” + Byy”* ia Bn—-1y + Bn = 0, 
where “the letters A and B with subscripts stand for arbitrary polynomials in x.” 


The goal is to compute the degree of the “final equation” ~(a) = 0 obtained by 
eliminating y. Following Poisson, Minding expresses the final equation as 


(1.10) w(x) = BY f(x, y1) - f (2, yo): +: f(@, Yn), 


where y1,.-.,Yn are the roots of 6(a,y). Minding expresses each y; as a Puiseux 
series in x about oo, so the exponents are decreasing. For example, 


ym = yl(z) =e +dya™—™ 4+ eya™—% 4---, O< a, <ag<::, 


where hy, a1, Q2,:-: € Q. Doing this for y2,...,yn and focusing on the highest 
power of a on each side of (£10), we obtain 


deg((x)) = deg (Bo(x)™ f(a, e102") --- f(x, enz”)). 


If we set b = deg (Bo(x)) (an integer) and k; = deg (f(x,c;x")) (often a fraction), 
then the above equation simplifies to 


deg(w(x)) = mb+ ky +---+kn, 
which is the major result of [275]. Here is an example from Minding’s paper. 
EXAMPLE 1.6. Suppose that 
f(x,y) = (a? )y* + (a?)y? + (a*)y? + (2° )y + (2°) 
O(a, y) = (a*)y? + (a®)y* + (@®)y? + (w")y? + (a?)y + (x*), 
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where (x*) means a polynomial of degree £. These polynomials have the following 
Newton polygons: 


(2,4) 


The commentary to English translation] explains how the negative reciprocals 
of the slopes of the polygon for @ give 


hy =hp =3, hg =ha =hs —8, 


2: 


This easily leads to 


ky =hko = 4, k3 = kg = ks =5, 
giving deg(q(z)) =4-8+ 44+ 3454+54+5=58. <P 


The commentary to English translation] goes on to show that in general, 
Minding’s formula for deg(~(a)) equals the mixed volume (mixed area in this case) 
of the Newton polygons of f and 6. So Minding’s paper from 1841 provides another 
splendid example of Bernstein’s Theorem. 


The Theory of Resultants. In the latter part of the 19th century, many papers 
on resultants were published, including Brill (1880) [43], Kronecker (1882) 
and Mertens (1886) [271]|272]. This continued into the early part of the 20th 
century with Macaulay (1902) and the books by Netto (1900) and 
Macaulay (1916) [265]. 

These papers and books focus primarily on homogeneous polynomials where 
all monomials of a given degree are allowed. This is the homogeneous analog 
of Bézout’s notion of a complete polynomial. Let fo,--- , fn be homogeneous in 
o,---,%y of degrees do,...,dp. Then the classical or dense resultant, denoted by 
Resao,...,d, (fo;--+; fn), is an integer polynomial in the coefficients of the f; with the 
property that over any algebraically closed field, 


fo =-::=fn = 0 has a 
solution in projective space P”. 


Kronecker, in his 1882 paper Grundztige einer arithmetischen Theorie der al- 
gebraischen Grdssen on the foundations of algebraic geometry, sketched a theory 
of resultants that gave elimination a central role. Here is a quote from his paper: 

A system of arbitrarily many algebraic equations for 3°, 3’,3,3° », in which the 

coefficients belong to the rationality domain ("’, RW”, 9%”, ...), define the algebraic 

relations between 3 and 8%, whose knowledge and representation are the purpose 

of the theory of elimination. 810] 


Kronecker’s notation and terminology are challenging for the modern reader—see 
for a modern account of [244]. But what Kronecker is saying in the quote 
does not differ greatly from the 1864 quote from Cayley given earlier. Kronecker 
never published the details of his approach, leaving it to his student Mertens to 
complete the task. 
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The first book to cover resultants in detail was Volume II of Netto’s Vorlesungen 
tiber Algebra [280], published in 1900. His treatment of resultants includes sections 
on Poisson, Minding, Kronecker, Cayley, Sylvester, and Bézout. He actually read 
Bézout’s Théorie Générale! 

In pp. 146-147], Netto generalizes the matrix described by Sylvester and 
Hesse in the 1840s. Recall that when f(x), g(v) have degree n,m, Hesse multi- 
plied f by 2”~!,...,2,1 and g by x”~!,...,a,1, and expressed these in terms of 
gmtn-l 2,1. This gave the Sylvester matrix whose determinant is Res(f,g). 

Netto studies what happens in general. He works affinely, so we will modify 
his notation to work homogeneously. Consider m+ 1 homogeneous equations 


fa(@o,---;%2m) =90, a=0,...,m 


of degree ng = deg(fa). We assume that fa is the sum of all monomials of degree 
Na With coefficients that are independent variables. This gives the universal version 


of the resultant in degrees no,...,%m, which we denote by 
Res(fo,---; fm) = Res, ae ice LOM edo) € Z (cl. 
Here, c is the vector of coefficients of fo,..., fm, 80 fo,---;fm € Zc, Lo,---, Lm]. 


To build a matrix that captures the resultant, set 
N=No +++: +Nm —- mM 


and multiply fa by all monomials «° of degree n — nq. Then express the products 
x f,, as linear combinations of all monomials of degree n. The coefficients form the 
rows of a matrix M of size © x N, where ¥ is the number of products x? f,, and N 
is the number of monomials z°. Using the definition of n, Netto shows that 


m+1 
n-Na tm nm+m 
en are Cn”) 


When m = 1, this is an equality and M is the Sylvester matrix of fp and f;. 


PROPOSITION 1.7. Let A be an N x N maximal minor of M. Then: 
(1) A is a multiple of Res(fo,..-, fm)- 
(2) AE Zl nr ((fo, vieshim) 2 AB0s + ia). 


PRrooFr. Let Mp denote the N x N submatrix of M with A = det(Mo). By 
construction, 


(14>) Mo bs = 2° fa 


where the column vector on the right consists of those x° f, that give rows of Mp. 
(1) Since Res(fo,..-, fm) is irreducible, it suffices to prove that for any choice 
of complex coefficients €, 


Res(fo,..-,fm)(2) =0 => A(@) = 0. 


This is easy, for by (11), the vanishing of the resultant for ¢ implies that for these 
coefficients, the system has a nontrivial solution  € P™. If we evaluate (L12) at ¢ 
and Z, we get a nonzero element of the kernel of Mo(@) since ¥ € P™. This implies 
that A(¢) = det(Mo(@)) = 0. 
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(2) The classical adjoint M2“! of Mo satisfies M?“) My = Aly and has entries 
in Qc]. If we multiply each side of (L12) by M3, we obtain 
De? E4 foye2i5 fea) 


for all x® of degree n. This implies A € (fo,.--, fm) : (£0,---;%m)”, and then (2) 
follows easily. 


Part (1) of Proposition says that when A is not identically zero, A = 
Res(fo,---, fm) for some extraneous factor E. In 1902, Macaulay showed 
how to find m+ 1 carefully chosen maximal minors Do,..., Dm of M so that 


Res(fo,---; fm) = gcd(Do,..., Dm). 


He also proved that for the D;, the extraneous factor E; is the determinant of an 
appropriately sized minor of D;. The gives the formula 


Res(fo, eva aig Je) = - 
that expresses the resultant as a quotient of two determinants. Thus we have two 
formulas for the resultant, both lovely and both wildly impractical for computations 
due to the large size of the matrices involved. 
Part (2) of the proposition is also important. Netto worked affinely, so his 
version of part (2) can be restated as 


Ae Zc NM (fo,-- viel ai) 


Thus A is in the elimination ideal Z[c] 9 (fo,.--, fm) in the affine setting. Netto 
uses this to prove part (1). 

When we switch to homogeneous polynomials, the usual elimination ideal is 
not helpful, since 

Z{c| N (fo; sey fm) — {O} 

when fo,.--,; fm are homogenous of positive degree in %o,...,2@m. This is where 
the projective elimination ideal Z[c| N ((fo, seby tin) 2 (BOps os ta) enters the 
picture. A key result is that the resultant generates the projective elimination 
ideal of fo,..., fm, ie., 


(1.13) UHes( Jon<ce4 Fea )) = ZI fount ea) js yn) 
For a modern proof, see [226] Proposition 2.3 and (4.1.2)], and for an introduction 
to projective elimination ideals, see Chapter 8, §5]. Once we know (L13), 
part (1) of Proposition [.7] follows immediately from part (2). 

We will encounter projective elimination ideals in Section [I.2] when we discuss 
van der Waerden’s work. 


Final Comments. Algebraic geometry in the 19th century was a vast enterprise. 
The 1892 review article by Brill and Noether is over 450 pages long. There 
was a lot going on besides elimination theory. 

One important topic we have not mentioned is enumerative geometry. Many 
famous results in the field—27 lines on a smooth cubic surface, 3264 conics simul- 
taneously tangent to five general plane conics—date from the 19th century. In one 
sense, the classic version of Bézout’s Theorem is a result in enumerative geometry, 
since it counts (with multiplicity) the number of intersections of n hypersurfaces in 
P” when the number is finite. We saw this in Example[I.1] where Newton counted 
tangent lines using Bézout’s Theorem. But researchers like Schubert, Chasles and 
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others developed powerful geometric methods that had nothing to do with elimi- 
nation. Writing in 1905, Brill comments that elimination theory is 
still outperformed by the well-known geometric counting methods that in the 
middle of the sixties [the 1860s] and ever since have flooded geometry with its 
enumerative results; I mean the characteristic method, the correspondence prin- 
ciple, and more recently the principle of conservation of number. [44] p. 278] 


The last of these, the principle of conservation of number, is important to our story. 
The “Princip von der Erhaltung der Anzahl” plays a central role in Schubert’s 
1879 book Kalkitil der abzahlende Geometrie. He explains the principle as follows: 
In algebraic interpretation, it says nothing else than that changing the constants 
of an equation either leaves the number of their roots unaffected or causes an 
infinite number of roots by making the equation an identity. p. 12} 


Though powerful, the principle of conservation of number lacked a rigorous founda- 
tion. In fact, at the 1900 International Congress of Mathematicians, Hilbert 
posed the following problem: 


15. Rigorous foundation of Schubert’s enumerative calculus 


The problem is to establish rigorously and with an exact determination of 
the limits of their validity those geometrical numbers which Schubert especially 
has determined on the basis of the so-called principle of special position, or con- 
servation of number, by means of the enumerative calculus developed by him. 
Note that Hilbert refers to the principle of conservation of number. But what he 
says next is even more interesting: 
Although the algebra of today guarantees, in principle, the possibility of carrying 
out the processes of elimination, yet for the proof of the theorems of enumerative 
geometry decidedly more is required, namely, the actual carrying out of the 
process of elimination in the case of equations of special form in such a way 
that the degree of the final equations and the number of their solutions may be 
known in advance. 
Hilbert is aware of the potential of elimination theory, but like Brill, he recognizes 
its limitations when applied to enumerative problems. 
We will soon see that in the 20th century, the interaction between elimina- 
tion theory and the principle of conservation of number led to some unexpected 
developments. 


1.2. Elimination Theory in the 20th Century 


After reaching a high point in the early 20th century, resultants and elimination 
theory went through some rough times in the middle part of the century, only to 
re-emerge stronger than ever in recent times. 

A seminal event was the 1890 publication of Hilbert’s great paper Ueber die 
Theorie der algebraischen Formen that can lay claim to being the first pa- 
per in commutative algebra. Besides introducing the Hilbert Basis Theorem, free 
resolutions, and Hilbert polynomials, Hilbert’s proofs also had a non-constructive 
aspect that was new to algebra. This was followed in 1893 with the Nullstellensatz 
in [206]. The ideas planted by Hilbert led directly to the work of Emmy Noether 
in the 1920s that revolutionized algebra. 
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In 1924, the young Bartel van der Waerden spent seven months in Gottingen, 
studying topology with Kneser and algebra with Noether. His first paper on alge- 
braic geometry [850], published in 1926, begins as follows: 

The rigorous foundation of the theory of algebraic varieties ... can be formulated 

more simply than it has been done so far, without the help of elimination theory, 

on the sole basis of field theory and of the general theory of ideals in ring domains. 
The phrase “without the help of elimination theory” tells us that something funda- 
mental has changed. It is likely that this sentence was actually written by Noether 
herself; in [306], we learn that she often wrote the introductions to papers by be- 
ginning students. But as we will see, this phrase represented an aspiration, since 
elimination continued to be an essential tool for many years. 


The Fundamental Theorem of Elimination Theory. This result was stated 
informally by Kronecker in 1882 and proved by Mertens in 1899 using resultants. 
In 1926, van der Waerden found a way to avoid resultants by using the Hilbert 
Basis Theorem and the Nullstellensatz. 

We present the theorem in modern notation. Assume that k is algebraically 


closed and let fi,..., fr € k[ti,...,tm,¥o,---;%n] be homogeneous in 2,...,2n. 
We think of t1,...,t,, as parameters, so that 

(1.14) fi(ti,- . sbi Os see ;La) = fr(ti,- .. bi Os tee fin) = 0 

is a system of homogeneous equations in 20,...,2%,, whose coefficients depend on 


the parameters t1,...,tm. The are two questions to ask about (L14): 


e (Geometric) For which choices of parameters does the system have a 
solution in P”? 

e (Algebraic) What constraints do the equations impose among the pa- 
rameters? 


The link between the two questions is contained the following result: 


THEOREM 1.8 (Fundamental Theorem of Elimination Theory). In the above 
situation, there exist D1,...,Dn € k[ti,...,tm] such that 


(1) Given a point (a1,...,@m) in affine space A”, the system 
filai,- oo pany Bj 25 Bp) SS f(a, - -+,Qm,U0;-- eye) =0 
has a solution in P® <= > Dy(ay,...,@m) =++: = Dn(ai,...,@m) = 0. 


(2) LeteW = V(fi,.--, fr) CA™ x P”. Then in the commutative diagram 
Ww —> A™ x P” 
A™ 
where 7 is projection onto the first factor, we have 


m™(W) = V(Di,. seg) Cc A, 


(3) Dy,...,Dp can be chosen to be generators of the projective elimination ideal 
klty,.. sgibin ia (iiss cagfay : (Xo0,-- ee ae 
In terms of the above questions, the vanishing of D,,...,D), tells us which 


parameters give a system (114) with a solution in P”, and the D; also tell us 
the algebraic constraints that the system imposes on the parameters. Note that a 


18 1. ELIMINATION THEORY 


solution in P” is nontrivial (the x; can’t all vanish), which is why we use the ideal 
quotient (fi,..., fs) : (v0,---,;%m)% rather than the elimination ideal (f1,..., fs). 

The polynomials D;,...,D);, are sometimes called a resultant system since they 
generalize the classical resultant, which by defines ¢(W) in Theorem[L.8]when 
the number of polynomials equals the number of variables. 


PROOF OF THEOREM First observe that (1) and (2) are clearly equivalent. 
We will give three proofs to illustrate how ideas about elimination have evolved. 


First Proof. In 1899, Mertens gave a proof using resultants [272]. We will explain 
how his argument begins. For fi,...,f-, set s = max(deg f;) and consider all 
products x° f; of degree s. Then set 


gj = Uta g@ fe 
for j = 0,...,n and variables u = (u(q,i),;). Mertens denotes the resultant of 
Jo;---,9n with respect to 9,...,U%p by Res(2° ye This is a polynomial in the 


new variables uq,;),;, 80 we can expand the resultant as a sum 


ne 2°28) I> Dalat 


This is sometime called the Kronecker u-resultant. The resultant system con- 
structed by Mertens starts with the polynomials Dg(ti,...,tm), but more need 
to be added since the Dg may vanish identically. Mertens knew that the Dg lie in 
the projective elimination ideal of f,,..., f-, though he didn’t say it this way. 


Second Proof. We will follow van der Waerden’s 1926 proof from [351], with a 
few changes. For simplicity, set t = (t1,...,tm) and a = (a9,...,%n), and let J 
denote the projective elimination ideal from part (3) of the theorem. We leave the 
following two observations to the reader: 


(A) Any g € f vanishes on 7:(W) C A”. Thus 7(W) C V(f). 

(B) To prove that 71(W) = V(J), it suffices to show that for any a € A™ \71(W), 
there is g € I such that g(a) 4 0. 

Now we give van der Waerden’s proof. Like Mertens, he considers all products 

x f; of degree s, except that now s is an arbitrary positive integer, and like Netto’s 

matrix in Proposition[I.7] van der Waerden gets a matrix M(t) with entries in k{f] 

satisfying 


(1.15) Ms(t) Pe = 2 


He calls this the dialytic matrix, a term due to Sylvester. (These days, the name 
Macaulay matrix is more common.) It has ws = (or) columns, and let the w, x ws 
minors of M(t) be 

(1.16) DY 552, DP Oe Ribs. teal: 


If no such minors exist, use 0 instead. An easy modification of the proof of Propo- 
sition [LZ] shows that DY? (t) € f for all j. 
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Following (B), take a = (a1,...,@m) € A™ \71(W). Then the system 
fi(a,z) =--- = f-(a,x) =0 


has no solution in P”. By the Nullstellensatz, (o,...,@n)* C (fi(a,x),..., f-(a,v)) 
for some s, which implies that when we evaluate at t = a, every x? is a linear 
combination of the x° f;(a, x). It follows that M,(a) has rank w,, so that one of the 
minors (L1G) is nonvanishing at a. Since the minors lie in 7, we have a complete 
proof that ™(W) = V(J). 

The final step in the proof is to use the Hilbert Basis Theorem to find finitely 
many generators D,,...,D,, of I. This is the desired resultant system. 
Third Proof. This proof uses scheme theory, created by Grothendieck starting in 
1956 and published with Dieudonné in Eléments de géométrie algébrique [181 
between 1960 and 1967. We will follow Hartshorne’s Algebraic Geometry [190], 
since many students learn the subject from this masterful exposition. The key 
result we need is the following. 


THEOREM 1.9. P? — Spec(Z) is a proper morphism of schemes. 
This is Theorem II.4.9 in [190]. Here, P? = Proj(Z[xo,...,Z,]) is projective space 
over the integers. Also recall that a morphism of schemes f : X — Y is proper if: 


e f is separated and of finite type (see Chapter IJ]). 

e f is universally closed, meaning that for every morphism Y’ —+ Y, the mor- 
phism X xy Y’ — Y’ is closed, i.e., maps closed subsets to closed subsets in 
the Zariski topology. 


The map X xy Y’ > Y’ is part of a Cartesian square 


X xy Y'—3X 


L > | 


y’ ——_> Y 


The proof of Theorem [L.9] uses the valuative criterion that will be explained later 
in the section. 

We now prove part (2) of Theorem[L.8] which says that in the Zariski topology, 
m™1(W) is closed in A”. The latter is really Ay’ = A’ xz Spec(k), and similarly 
Pr =P xz Spec(k). Since W = V(fi,..., fr) C A” x, P® is closed, the proof is 
encapsulated in a single commutative diagram of schemes with a Cartesian square: 


closed 


W—} A™ x, P? ———> P2 


closed closed proper 
image 


Ay ——— > Spec(Z) 


This argument, taken from a 1991 paper of Jouanolou , reflects Grothendieck’s 
philosophy: once things are set up correctly, basic results become obvious. 


At its heart, Theorem [L.8]is an existence theorem. This is reflected in how the 
three proofs show that 7(W) = V(J): 
e In Proofs 1 and 2, the key is to create polynomials in T that precisely define 
m™1(W). Proof 1 does this using resultants, while Proof 2 replaces resultants 
with minors of a matrix of polynomials. 
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e In Proof 3, the valuative criterion guarantees that any point of V(J) is a 
partial solution extends to a solution in W (in the language of ye 


The evolution from Proof 1 to Proof 2 is easy to understand. Van der Waerden 
knew Mertens’s 1899 paper and Macaulay’s 1916 book but also understood the full 
power of what Hilbert did in 1890 and 1893. The introduction to his 1926 paper 
351) notes that the main result is 

an almost immediate consequence of two fundamental theorems of HILBERT, 

which may be called HILBERT’s Nullstellensats and HILBERT’s basis theorem. 

Proof 2 is closely related to the proof used in Ideals, Varieties and Algorithms 
Theorem 6 of Chapter 8, §5]. 

The replacement of resultants with the dialytic matrix M,(t) used in Proof 2 
is also interesting. The key step in the proof was to show that if the f;(a, x) vanish 
only at (0,...,0), then M,(a) has rank ws. Here is how van der Waerden states 
this result: 

If the H-ideal (fi,..., fr) has only the solution (0,...,0) in Q, then the dialtyic 

matrix of the ideal for degree s has rank ws, and conversely. 

(Here, “H-ideal” means homogeneous ideal and 2 is an algebraically closed field.) 
We will see later that replacing resultants with rank conditions on a matrix of 
polynomials is a powerful idea that is still used in current research. 

Going from Proof 2 to Proof 3 represents a substantial change—something 
dramatic happened to algebraic geometry between van der Waerden’s 1926 paper 
and Hartshorne’s 1977 book. As we will soon see, the Principle of Conservation of 
Number played a key role in this change. 


Sul Principio della Conservazione del Numero. This is the title of a paper 
written in 1912 by Severi [318]. Given the concerns about the Principle of Conser- 
vation of Number raised by Hilbert and others, Severi acknowledges that the desire 
for a precise statement of the principle is 

something more than just a scruple about exaggerated rigor. 


In 1971, van der Waerden summarized Severi’s version of the principle as 
follows. We have irreducible projective varieties U,V over the complex numbers 
C and an irreducible subvariety W C U x V such that for a generic € € U, there 
are exactly h points (€,71),..-,(€,7n) € W lying above €. This says that we have 
diagram 


WCUxVYV 


(1.17) a. 
EE€U V3™m,-++5% 
where the map 7 has generic degree h. Take a smooth point « € U and let € ap- 
proach x, where Severi is thinking topologically. We write this informally as € ~ x. 
Now suppose that 7; /(2) is finite. Then Severi claims that € ~+ x extends to 
(€,m) ~ (a,yi:) © W for i = 1,...,h, where the points y1,...,yn, € V have the 
following properties: 


1) No matter how € approaches 2, we always get the same points y1,.-.,yn € V 
gS 
up to order. 
(2) The obvious inclusion {y,..., yn} C 7 1(2) is an equality. 


1.2. ELIMINATION THEORY IN THE 20TH CENTURY 21 


(3) If yw of the n; have the same limit y € V, we say that y has multiplicity uw. Then 
the sum of the multiplicities ps is equal to h. 


This is Severi’s version of the Principle of Conservation of Number. 

To make this rigorous, we need careful definitions of generic point, multiplicity, 
and what € ~» x really means. We also need to address extension: why does € ~ x 
extend to (€,7:) ~» (a,yi)? Van der Waerden tackled all of these questions in a 
series of papers. We will focus on generic points and € ~ x but ignore the more 
substantial topic of multiplicity (interested readers can consult [2871/3061/353] ). 


Generic Points. In their 1915 book on algebraic geometry, Enriques and Chisini 
define a generic point of an irreducible variety as follows: 
The notion of a generic ‘point’ or ‘element’ of a variety, i.e., the distinction 
between properties that pertain in general to the points of a variety and properties 
that only pertain to exceptional points, now takes on a precise meaning for all 
algebraic varieties. 
A property is said to pertain in general to the points of a variety Vn, of 
dimension n, if the points of V, not satisfying it form — inside V, — a variety of 
less than n dimensions. 
In their view, “generic” is a relative concept that depends on the property in ques- 
tion. A point can be generic for one property but not generic for another. This is 
still one of the standard meanings of “generic.” 

In his first 1926 paper [850], van der Waerden proposes an absolute notion of 
“oeneric,” where a generic point lives in an extension of our original field k. For 
simplicity, we assume that k is algebraically closed. Let 


pC R=K[x1,...,2n] 


be a prime ideal. This gives the quotient ring R/p with field of fractions 2. Note 
that k C Q. Geometrically, p defines an affine variety V, and for any extension field 
k C L, we write V(L) to denote the solutions over L, ie., 


VIE) =H 1 Gigs Gy) CL | f(ai,.--,;@n) = 0 for all f € p}. 


Van der Waerden constructs a particular point € = ({,...,&) € V(Q) as follows. 
For each 7, consider the coset €; = x; +p € R/p C Q. Then one sees immediately 
that if f € klai,...,¢p], then 


This shows that € = ({),...,€,) € V(Q). But this is not just any point of V(Q), 
for by (LI8), any f € k[x1,...,v,] that vanishes at the single point € € V(Q) also 
vanishes at every point of V(L) for every k C L. For this reason, van der Waerden 
calls € an “allgemeine Nullstelle” (“general zero”) of p. From this point of view, a 
property is generic for V if and only if it holds at €. 

This notion of generic point is similar to the generic points defined in Weil’s 
1946 book Chapter IV, §1]. It is also related to what happens in scheme 
theory, where the prime ideal p C R = k{a1,..., 2] gives the point 


p € V(p) C Ag = Spec(F). 


In this situation, a polynomial f € R has the value f(p) € k(p) given by the image 
of f in the residue field of Og» », which is simply van der Waerden’s field Q in 
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fancier language. It follows that 


(1.19) Flip) =0 => f ep. 
This is the modern version of (L.18). 


Specialization. For Severi, € ~ a used the usual topology of C. In 1927, van 
der Waerden figured out how to do this algebraically using what he called 
a “relationstreue Spezialisierung” (“relation-preserving specialization” ). Later this 
was shortened to specialization. 

We explore a simple case of specialization that focuses on diagram and 
uses van der Waerden’s generic point € € U(Q). Let © be the algebraic closure of 


Q and suppose that we have 7 € V(Q) such that (€,7) € W(Q). Recall that U and 
V are projective, so that U x V CP™ x P” for some m,n. 

In this situation, we say that a point (x,y) € W(k) is a specialization (in the 
sense of van der Waerden) of (€,7), written (€,7) ~> (a, y), provided that 


(1.20) A(é,n) =0 = > H(2,y) =0 


for all bihomogeneous polynomials H in the coordinate ring k[P™ x P"]. Then we 
have the following existence result. 


PROPOSITION 1.10. With the above setup, take any point x € U(k). Then there 
exists a point y € V(k) such that (x,y) € W(k) and (€,n) ~ (2, y). 


Proor. The ideal of all H € k[P™ x P"] that satisfy H(€,7) = 0 defines a 


subvariety Wo C U x V such that (€,7) € Wo(Q). We have the diagram 
Wo— U x V 


NI 


U 


where V is projective. With a little thought, one sees that Theorem implies 
that 71(Wo) C U is closed. Since 71(€,7) = € is a generic point of U, it follows that 
1™1(Wo) = U, so that x € U lifts to (a, y) € Wo. Then (1.20) follows immediately 
from the definition of Wo. 


Note also that since € is a generic point of U, we have a specialization € ~~ «x if 
and only if « € U—this follows immediately from the projective version of (L18). 
Thus we can restate Proposition [.10]as a result about extending specializations. 


PROPOSITION 1.11 (Extension of Specializations). Given (€,7) as above, any 
specialization € ~ x extends to a specialization (€,n) ~ (x,y). 


Van der Waerden was aware that the proof of Proposition [LIO/L.11] required 
the Fundamental Theorem of Elimination Theory. So the goal stated in his 1926 
paper of finding a purely algebraic foundation for algebraic geometry “without 
the help of elimination theory” remained unfulfilled. Nevertheless, after tackling 
multiplicities in his 1927 paper [352], van der Waerden was able to provide a 
complete proof of Severi’s version of the Principle of Conservation of Number. 

We should also say a few words about what specialization means in scheme 
theory. For concreteness, let R = k[a1,...,2n], & algebraically closed, so that 
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maximal ideals of R correspond to points in k”. These are the closed points of the 
affine scheme Aj = Spec(R). Given points p,q € A?, we have the equivalences 


qef{p} — = pCq 
<=> forall fe R, f(p) =0 = > f(q) = 0, 


where we are using the Zariski topology on A? and f(p) = 0 has the same meaning 
it did in (1.19). In this situation, we say that q is a specialization of p and write 
p~ q. The relation to (1.20) is clear. 


Weil and Elimination. In 1946, Weil published Foundations of Algebraic Ge- 
ometry [359], where he drew on ideas of van der Waerden, Zariski and others to 
create his own unique approach to algebraic geometry. As already mentioned, Weil 
defines a version of “generic point” similar to van der Waerden’s, and the same is 
true for the definition of specialization given in Chapter II, §1]. 

In his book, the title of Chapter H, §2 is “Extension of Specializations.” This 
is where Weil proves his version of Proposition [11] The crucial difference is that 
he does not use elimination theory. Rather, he proves that specializations extend 
by means of a “device” of Chevalley. In a footnote on page 31, Weil says that 
Chevalley’s device, 

it may be hoped, finally eliminates from algebraic geometry the last traces of 

elimination-theory 
This is the only mention of elimination theory in Weil’s book. In his view, it has 
no role in algebraic geometry. 


The Valuative Criterion. In Chapter I of [359], Weil was trying to find the 
essence of what it means to extend a specialization. This process continued with 
Grothendieck, who realized that Zariski’s theory of valuation rings provides the 
right tool to understand the heart of the matter. This led to the valuative criterion, 
which we now explain. 

An integral domain R with field of fractions K is a valuation ring if for every 
a € K, we have either a € R or a! € R. Then R is a local ring with maximal 
ideal m = {a € R| a! ¢ R}. The ring R has distinguished prime ideals {0} 
and m, which are the generic and special points of the affine scheme Spec(R). The 
inclusion R C K gives a morphism Spec(i) + Spec(R) that sends the unique point 
of Spec(fc) to the generic point of Spec(R). A comprehensive account of valuation 
rings appears in Chapter VI]. 

Now suppose that Y is a scheme with points €,y € Y. Then one can prove that 
€~ y if and only if there exists a valuation ring R and a morphism Spec(R) > Y 
that sends the generic and special points of Spec(R) to € and y respectively. 

This allows us to formulate an especially nice case of what it means to extend 
a specialization € ~ y in Y. Suppose that we have a commutative diagram 

Spec(K) —~> X5€' 


(1.21) | ra | 
Spec(R) ——> Yodé,y 


where Spec(R) > Y is as above and €’ € X maps to € € Y. When the dotted 
arrow exists in (L.21) making the whole diagram commutative, the special point of 
Spec(R) maps to « € X giving a specialization €’ ~ a in X that extends € ~» y. 
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The valuative criterion states that a separated morphism of finite type X > Y 
with X Noetherian is proper if and only the dotted arrow exists for every diagram 
(L.2T) where R is a valuation ring. This is proved in Theorem II.4.7]. The 
result is a lovely sequence of ideas: 


e A rigorous foundation for the Principle of Conservation of Number requires 
the ability to extend specializations. 

e This was first proved by the Fundamental Theorem of Elimination Theory, 
which was then eliminated with ideas that led to the valuative criterion for 
properness. 

e The valuative criterion gives a quick proof that P? — Spec(Z) is proper, 
which has the Fundamental Theorem of Elimination Theory as an immediate 
corollary. 


Abhyankar and Elimination. Besides wonderful mathematics, there is also a 
human side to this story. At the 1970 International Congress of Mathematicians, 
van der Waerden gave a talk entitled The Foundation of Algebraic Geometry from 
Severi to André Weil, where he sketched the developments from 1912 to 1946 that 
we have been discussing. His lecture was later published as [353]. 

By 1970, the Grothendieck revolution in algebraic geometry was in full swing. 
One of the people who heard van der Waerden speak at the ICM was Abhyankar, 
a student of Zariski who had a deep affection for 19th century algebraic geometry 
and no love for the extreme abstraction of the Grothendieck school. A poem he 
published in 1971 begins as follows: 

Polynomials and power series 

May they forever rule the world. 


Eliminate, eliminate, eliminate 
Eliminate the eliminators of elimination theory. 


As you must resist the superbourbaki coup 
So must you fight the little bourbakis too. 


Kronecker, Kronecker, Kronecker above all 
Kronecker, Mertens, Macaulay, and Sylvester. 


Not the theology of Hilbert 

But the constructions of Gordan. 
Most of the names are familiar. Abyhankar imagines a conflict between the new 
and the old, making it clear where his sympathies lie. Fortunately, there is no 
conflict—scheme theory has proved to be a powerful, flexible language that has 
enabled algebraic geometry to not only give rigorous proofs of the geometric insights 
of the 19th century but also give rise to whole new areas of inquiry, including the 
minimal model program, arithmetic algebraic geometry, and a host of applications. 


The Rebirth of Elimination. When Weil’s book appeared in 1946, elimination 
theory was at low ebb. New impetus came with the 1965 PhD thesis of Bruno 
Buchberger and his 1970 article that introduced Grobner bases, the Buch- 
berger criterion and Buchberger algorithm. A few years later, Trinks proved the 
following result in a paper published in 1978. If R = t[X1,...,X,], then: 
KOROLLAR. Ist < die lexikographische Ordnung, Ry = t[X1,...,Xv] (0< uv <n) 
und © eine mittels < gebildte G-Basis von A JI KR, so ist 6, := BNR, eine 
G-Basis von Ay := WAAL. 
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This is now known as the Elimination Theorem. Elimination theory is back in 
business after a 20 year hiatus! Furthermore, by the 1970s, researchers had an 
algorithmic basis for elimination, independent of resultants, and computers that 
allowed them to implement these algorithms. This led to the field of computational 
commutative algebra and algebraic geometry, which continues to be an active area 
of research today. 

Besides these developments, other interesting things happened in the 1970s. 
Here are a few highlights: 


e In 1975, Bernstein’s paper The number of roots of a system of equations 
proved a version of Bézout’s Theorem that used mixed volume. 

e In 1975-76, Giraud taught a course on elementary algebraic geometry 
that discussed the Sylvester resultant. 

e In 1976, Abhyankar wrote an impassioned defense of elementary algebra 
(what he called “high-school algebra”) in algebraic geometry [4]. 

e In 1977, Lazard published Algébre linéaire sur K[X,,---, Xn], et élimination 
as part of a project begun in 1974 on commutative algebra. 

e In 1978, Wu published On the decision problem and the mechanization of 
theorem-proving in elementary geometry [864], which drew on the algorith- 
mic tradition in classical Chinese mathematics to create an efficient approach 
to systems of equations. 

e In 1979, Jouanolou wrote Singularités rationelles du résultant [224], which 
applied the full power of modern algebraic geometry to the resultant. 

It is interesting that Lazard, Giraud and Jouanolou were trained in Grothendieck- 
style algebraic geometry yet recognized the value of resultants and elimination 
theory. We will say more about Bernstein’s work below. 


The Rebirth of Resultants. To get a better sense of the status of resultants 
in 1979, let’s look more closely at Jouanolou’s paper |224]. He fixes a Noetherian 


ring & and considers homogenous polynomials fo,..., fn of degree do,...,d, in 
Xo,.-.,Xn whose coefficients are independent variables. Let A = kc], where c is 
the vector of coefficients of fo,..., fn, and set 


AN = Spec(A), W = Proj(A[Xo,.--,; Xn]/(fo,---; fn)): 


In down-to-earth terms, points of AN parametrize all possible choices of the coeffi- 


cients ¢ of fo,.--, fn, and W consists of all pairs (¢,Z) € AN x P? such that 
(1.22) fiep==fien=0 
The resultant Resg,,....4, (fo,---, fn) is a polynomial in Z[c] that vanishes whenever 


the equations have a solution. It follows that we have a commutative diagram 
of schemes: 


W —> AN x pp 


v0 [| 


V(ReSao,...,dn) ——> AV 


Jouanolou proves that the map W — V(Resg,.....a,) is a resolution of singularities 
and uses duality to study where V(Resq,,....a,,) has rational singularities. 

In [224], Jounanolou cites Macaulay’s 1902 paper and 1916 book since in 1979, 
these were the best references available for the properties of resultants. So his next 
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project was to write a series of papers (1980), (1991), (1995), 
(1996) and (1997) that not only created the modern theory of the classical 
resultant but also revealed its intimate connection to commutative algebra. When 
writing the chapter on resultants in Using Algebraic Geometry [105], Jouanolou’s 
paper from 1991 was an invaluable resource. 

One recent contribution is Stagliand’s Macaulay2 package Resultants for 
computing the classical resultant, published in 2018. 


Bernstein’s Theorem. Another important development was the theory of sparse 
polynomials. We have seen scattered examples going back to Bézout, but in the 
19th century much of the focus was on complete polynomials and their homogeneous 
analogs. In the sparse case, we consider Laurent polynomials f € kla7',...,x*+], 
where we now work affinely and allow negative exponents. The exponent vectors 
of f lie in Z”, and their convex hull is a lattice polytope denoted N(f) C R”. 

The algebraic geometry of sparse polynomials was studied by Bernstein, Kho- 
vanskii and Kouchnirenko in the 1970s. A key result is Bernstein’s theorem from 
1975 [25]. Here is a version due to Fulton Section 5.5]: 


THEOREM 1.12. Suppose fi,...,fn € C[xf',...,21] have Newton polytopes 
Q1,---;Qn, and assume that Z” is affinely generated by the exponent vectors of 
fi,---;fn- Then for 


Zaire (CY? | fiG@= =F) =o), 


we have 


SO) Aste) SV Oi 4a): 


z€Z isolated 


Because of the interactions between Bernstein, Khovanskii and Kouchnirenko, 
Theorem [1.12] is sometimes referred to as the BKK bound. A standard refinement 
of Theorem [1.12] is that if the coefficients of the f; are generic, then the solution 
set Z is finite and consists of precisely MV,(Qi,...,Qn) points. See, for example, 
Theorem 6.2.8]. Let’s say a bit more about Bernstein’s theorem: 


e We work in (C*)” because fi,..., fn are Laurent polynomials. 
e An isolated point z € Z has intersection multiplicity 


(23 fiy.- + fn) = dim O(cyn,2/ (fas +++ fn): 


e MV,(Q1,.--,Qn) is the n-dimensional mized volume of Q1,...,Qn. We 
recall the definition in Example below. Note that we use the version 
that satisfies MV,(Q,...,Q) =n! Vol,(Q). See $7.4] for more details. 

e Fulton’s version of Bernstein’s theorem is from his book Introduction to Toric 
Varieties [157]. In Section 5.4, he explains the toric connection by construct- 
ing a projective toric variety Xy such that f,,..., f, correspond to sections 
of line bundles L1,..., 2, on Xy whose intersection product is 


Ly + Lg-++ Ln = MV,,(Qi,..-,Qn). 


This is Bézout’s Theorem for the toric variety Xy. 


In Section [LI] we saw some interesting examples of sparse polynomials from 
the 18th and 19th centuries. Let’s return to a case considered by Bézout. 
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EXAMPLE 1.13. Recall from Example [1.4] that a polynomial f in 2, y, z is of 
the third species if there are positive integers t, a1, @2, a3, 61, b2,b3 such that the 
monomials 21 y*2z*3 of f satisfy 

ky tho tks <t 
(1.24) O<ki<a1, O<ko<a2, O0<k3 <az 
ky tho <b3, ky t+k3< bo, ko +kz < dy. 


Here, we replace Bézout’s awkward a, a,a with a1, a2, a3, and similarly for by, ba, bs. 
The first form of the third species is when t, a1, a2, a3, b1, bz, bs satisfy 


t—b, —bg+a3 <0, t—b,—b3+a2 <0, t—bo-—b3 +a, <0. 


This give a polytope denoted P = P(t, a1, a2, a3, bi, be, bs). Here is a typical picture: 


To compute the Euclidean volume Vols(P) of P, we start with the simplex 
A3 = {(a,b,c) € R® | a,b,c > 0,at+b+e<1} CR’, 


which has volume Vol3(A3) = 1/6. The first line of tells us that P sits inside 
tA3. Then the next line of (1-24) chops off three corners of tAs3: 


The corners, shown in light gray, are translates of (t — a,)As, (t — a2)As and 
(t — a3)A3. It follows that the removing the corners leaves us with the volume 


Ga) Ata 4b aa) 


It remains to consider the third line of (1.24), which removes the three prisms 
shown in darker gray. The reader can verify that the bottom prism has volume 


(t — b3)?(3a, t 3a2 2b3 t) =— a(t b3)? t (ay + G2 bs)(t b3)?. 
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Using similar formulas for the other two prisms, we get 
Vol3(P) = ¢(t? — (t— a1)? — (t — a2)? — (t- a3)? + (t- 01)? 
(1.25) + (t — by)? + (t — bg)? — 3(a, + ag — b3)(t — b3)? 
3(a1 + a3 — b2)(t — bz)” — 3(a2 + a3 — b1)(t — b1)”). 


Now suppose that f,g,h € Cx, y,z] have N(f) = N(g) = N(h) = P and f,g,h 
are generic within all polynomials with this Newton polytope. Then all solutions 
of f =g =h=0 in C lie in (C*)3, and the number solutions is given by 


MV;(P, P, P) = 3! Vols(P) 
= t? — (t—a,)° — (t — aa)? — (t— a3)? + (¢ -— 1)? 
t (t bz)? t (t by)? 3(a1 + ag bs)(t b3)? 
= 3(a1 + a3 be) (t by)? 3(a2 t a3 bi)(t b1)?, 
where the first equality is a standard property of mixed volume and the second 
follows from (1.25). 

Since (1.26) is the number of solutions, this should be the degree of Bézout’s 
“équation finale résultant,” denoted D in Example [4] If you take formula for D 
given in (L.7) and set 
(1.27) t=t=t", a4, =a=a' =a", ag =a=a'=a",..., bj =b=b' =b", 
then D reduces to (1.26) on the nose! 

But it gets better, for suppose now that we have f,g,h with 

N(f) = P = P(t, a1, a2, a3, 1, be, bs) 
N(g) = P: — PE, a4, ay, a3, bi, bb, bs) 
NG) SP PE iy 9g a i as Oe) 


all of the first form of the third species. Then we have the following nice result. 


(1.26) 


PROPOSITION 1.14. Bézout’s formula for D from (L7Z) satisfies the equation 
D = MV3(P, P’, P”). 


ProoF. The mixed volume MV3(P, P’, P”) is constructed by first expanding 
the volume of the Minkowski sum 


(1.28) Vol3(A,P + A\gP' + A3P") 
as a polynomial in ;, 2, A3. Then the mixed volume MV3(P, P’, P”) is defined to 
be the coefficient of A, A2A3. If we set t = (t, a1, a2, a3, 61, b2, bz) and similarly for 
t’ and t”, then 

AiP + A2P’ +A3P" = P(t + A2t’ + A3t”). 

Since P, P’, P” are of the first form, the same is true for the above Minkowski 
sum. So the volume (1.28) is computed using the same polynomial that appears in 
(1.25). Thus MV3(P, P’, P’”) is a polynomial in t,t’, t’”, which we write as 

MV,(P, P’, P”) = MV(t,t’,t”). 
One can check that MV has the following properties: 

e SYMMETRIC: MV(t,t’,t”) = MV(t’,t,t”) = MV(t’,t”,t) =---. 

e MULTILINEAR: MV(t,t’,t”) has degree 1 in each of t,t’, t”. 

e DIAGONAL: MV (t,t, t) = 3! Volg(P(t)). This is the polynomial in (L.26). 
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This means that MV(t,t’,t”) is the blossom of the polynomial in (1.26). We 
recommend for a nice introduction to blossoming. 

But now look at Bézout’s formula for D from (7). It has the same symmetric 
and multilinear properties, and in (1.27), we showed that on the diagonal, it reduces 
to 3! Vols(P(t)). So D is also a blossom of (1.26). Since the symmetric, multilinear 
and diagonal properties uniquely characterize the blossom, the result follows. 


Bézout did not think geometrically about polynomials of the first form of the 
third species, so his derivation of D (described in [288]) uses totally different meth- 
ods. But it is breathtaking to see such a wonderful example of mixed volume in a 
book written in 1779. << 


Sparse Resultants. Just as Bézout’s theorem for complete polynomials led to 
the classical resultant, it should be no surprise that Bézout’s theorem for sparse 
polynomials led to the sparse resultant. Here is the basic idea. For concreteness, we 


will work over the complex numbers. Suppose we have finite sets Ag,...,An C Z”. 
The resulting Laurent polynomials 
(1.29) fil ig22<58n)'= S- Cred). PS Opies t 
ackA; 
give a system of n + 1 equations fo = --- = fy, = 0 in (C*)". Since this space has 


dimension n, the system usually has no solutions. But for special choices of the 
coefficients ¢;,q, solutions can exist. This is what the sparse resultant does. 

For the rest of our discussion of sparse resultants, we will make two simplifying 
assumptions: 

e The convex hulls Q; = Conv(A;) C R” have dimension n for i = 0,...,n. 

e Z” is affinely generated by Ap U---U An, 
The coefficients cia of fo,.--,fn give a point in AN, where N = es |A;| is the 
total number of coefficients. Set 


W = {(Cia,t) € AN x (C*)” | fo(x) = +++ = fn(x) = 0} C AY x (C*)", 


and let Z C AN is the Zariski closure of the projection of W. Here is the basic 
result [165] 8.1]. 


THEOREM 1.15. With the above hypotheses, there is an absolutely irreducible 
polynomial 
Res.4y,...,An € Z|Ci,a | O<i<n,ae Ail 


with relatively prime coefficients such that Z = V(Res,,,....4,). Furthermore, for 
a fixed i, the polynomial Res 4,,....4,, 18 homogeneous in the coefficients c;,q of fi of 


degree MV,,(Qo,...- Oe ..+;Qn)- 


It follows that we have a commutative diagram 
Ww ——->+ AN x (C*)” 
(1.30) | | 
V(Res4o,...,An,) ——> A% 


Res.4),...,A, is sometimes called the mized sparse resultant (since each polynomial 
is allowed to have different support). 
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EXAMPLE 1.16. The classical resultant Resq,,...., is the special case of the 
sparse resultant where A; = (d;A,)MZ” and A,, is the standard n-simplex: 


An = {(a1,---,@n) € R” | a1,...,4n) > 0,a, +--+ +a, < 1}. 
Then (1.30) is a version of diagram (1.23) that uses (C*)” in place of P”. << 


EXAMPLE 1.17. Let A = A; =--: = A, C Z", where Q = Conv(A) C R” 
has dimension n and Z” is affinely generated by A. Then Resy),....4, (fo,---; fn) 
is often written Res4(fo,..-, fn), called the sparse A-resultant or unmixed sparse 
resultant (since all polynomials have the same support). 

The lattice polytope Q gives a toric variety Xg with torus (C*)”, and fo,.-., fn 
can be homogenized with respect to the total coordinate ring of Xg to give poly- 
nomials Fo,..., Fn (see Section 13]). Then the equations Fy = --- = F, = 0 
define W Cc AN x Xqg, where as above the coefficients of the F; live in A. In this 
situation, the diagram becomes 


W —_> AN x Xg 


| | 


V(Res4) ——-+ AX 


In (£.30), the resultant vanishes when the equations have a solution in (C*)", while 
in this diagram, the resultant vanishes if and only if the equations have a solution 
in Xg. There is a deep link between sparse resultants and toric varieties. << 


Sparse resultants are discussed in detail in Discriminants, Resultants, and Mul- 
tidimensional Determinants [165], published in 1994 by Gel’fand, Kapranov and 
Zelevinski. This astonishing book contains a wealth of material, including the im- 
portant concept of discriminant that we have completely ignored. 

We have now arrived at the end of the 20th century, with many omissions. But 
what about applications? We will give one example that describes what happened 
when geometric modeling first became aware of elimination theory. 


Geometric Modeling. In the early 1980s, a PhD student in mechanical engineer- 
ing named Tom Sederberg read the book Computational Geometry for Design and 
Manufacture, published in 1979. When discussing the intersection of parametrized 
surfaces in R*, the authors write: 
Great advantage would be had if one surface could be translated into implicit 
form. Unfortunately, the most commonly used parametric surfaces, the bicubic 
patches, cannot be expressed this way. p. 258] 
Parametrized surfaces will be covered in Chapter[3] so here, we will say just enough 
to understand the quote. A bicubic patch in R® is a surface parametrization 


3 3 3 
(1.31) = sc age, y= S- bys, 2= S- Geet, 
i,j=0 i,j=0 i,j=0 
and its implicit form is the equation F(x, y, z) = 0 of the surface. This is clearly 
a problem in elimination theory, which according to [146] can’t be done! As the 
first book on geometric modeling, reflected what the newly emerging field of 
geometric modeling knew about algebra at the time. 
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Sederberg was intrigued by what he read in and thought this might be 
a good topic to study. Being a mechanical engineer, he did not find the algebraic 
geometry of the 1980s to be helpful. Instead, Sederberg turned to the older lit- 
erature on elimination, including a 1908 paper of Dixon entitled The eliminant of 
three quantics in two independent variables [133]. In this paper, Dixon writes: 

I now go on to the discussion of the eliminant of three quantics of degree m+n, 

in which the highest power of x and y respectively are x” and y™, each quantic 

being of the form 


SOA ye OS 012 S01, Qian mi): 89] 


The Newton polytope is the rectangle nA; x mA, which for the polynomials (E31) 
is the square 


Hence we are dealing with sparse elimination. In 1984, Sederberg, Anderson and 
Goldman published Implicit Representation of Parametric Curves and Surfaces, 
where the introduction notes that 

... the general theory of elimination is a lost art, with obscure references. 


The goal of their paper is to use classic ideas of Sylvester, Cayley and Dixon to 
implicitize certain parametric curves and surfaces. 
For a curve parametrization 


a(t)  _ O(t) 

~ eft)? 2 et)’ 

where a, b,c are relatively prime polynomials of degree n, the authors explain how 
the Sylvester formula for the classical resultant gives the implicit equation 


(1.32) F (x,y) = Res;(c(t)a — a(t), c(t)y — b(t)) = 0 


when the parametrization is generically injective. 

For surfaces, they turn to Dixon’s paper, where the goal is to find the “eliminant 
of quantics ¢,, x.” When ¢,w, x have Newton polygon nA, x mA, this is the 
sparse A-resultant Res4(¢, wv, x) for A = (nA; x mA,)NZ?. In [133], Dixon gives 
three methods to express this resultant as a determinant. 

For a bilinear surface parametrization 


© =aj9+a1,8+ aot + agst 
y = bo + bys + bot + bgst 
Z=Co+ce1$+ cot + cgst, 


we move 2,y,z to the right hand side of each equation, and then Dixon’s third 
method leads to the implicit equation 


ag — x ay az a3 O uy 
bo — y by by 63 0 0 
_ Co—Z2 Cl c2 cz O 0 = 
F(a,y, z) = det 0 ao — 2x 0 ag a, a3 7 0, 
0 b-y O bz by bg 
0 co-z 0 G@ | 
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which is quadratic in x,y,z. This determinant appears in 5.1]. In general, 
Dixon’s method gives a 6mn x 6mn determinant—see [105) Exercise 7.2.10]. 

The paper also uses Dixon’s first method for the resultant, which gives a 
2mn x 2mn determinant based on a variant of the Bézoutian from Section[L.1] The 
entries of the matrix are linear in x,y,z, so that the implicit equation has degree 
2mn. In the bicubic case, n = m = 3 gives a surface of degree 18. This number 
was expected to be 18 because of ray tracing, but was the first geometric 
modeling paper to give a general formula for the degree of the surface. We will 
discuss a modern version of the degree formula in Section B.2] All of the resultants 
appearing in are given by determinants. This is rare, since most determinants 
giving resultants have extraneous factors, similar to Proposition[L.7] In the unmixed 
case, Sylvester-style representations (e.g., Dixon’s third method) are classified in 
Proposition 13.2.2], and shows that each of these has a Bézout-style 
representation (e.g., Dixon’s first method). 


A Preview of the 21st Century. To give a hint of what has happened in 
geometric modeling since 1984, we end with a theorem from a 2016 paper of Botbol 
and Dickenstein [41]. Suppose that we have a parametrization 


c= fils, t) y= fa(s, t) = f3(s, t) 

fo(s,t)’ fo(s,t)’ fo(s,t) 
and want to find the implicit equation. Our only assumption on fo, fi, fa, fg is that 
they are relatively prime. 

One approach would be to use a sparse resultant based on the exponent vectors 
of the f;. But sparse resultants are not easy to compute. So instead, we replace the 
resultant with a matrix of polynomials similar to what we did in Proposition[L.7Jand 
Theorem But those matrices were built from the coefficients of the polynomials, 
while here, we will use syzygies. Recall that a syzygy on fo, fi, fo, fg consists of 
polynomials ho, hy, hz, hg satisfying 


(1.34) hofo + hifi + hafe + ha fs = 0. 


To reduce to a finite set of syzygies, let P C R29 be a lattice polygon that contains 
the Newton polytopes of fo, fi, f2, fs. The set of all syzygies where the 
Newton polytopes of the h; are contained in 2P is a finite dimensional vector space 
that can be computed by linear algebra. 

A basis of such syzygies gives polynomials 


LY = ho(s,t)Ow + nO (s, a + AY (s,t)y + nO (s,t)z, 1=1,...,N, 


where w, x,y,z are homogeneous coordinates on P?. Now rewrite each polynomial 
in the form 


(1.33) 


co= ‘> dai, L,Y, 2) s't? 
(i,j)€2PNZ? 


for linear forms 6). (w, a, Y,2). When we arrange the ee , into a matrix M of 


linear forms, we have the following result from ‘ 


THEOREM 1.18. In nice cases, the rank of the matriz M drops precisely when 
evaluated at the points in the closure in P? of the surface parametrized by (33). 


There are a lot questions to ask here. What do we mean by “nice cases”? And 
why syzygies? All will be answered in Section 3.2] An especially intriguing feature 
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of Theorem [L.18]is the replacement of resultants with rank conditions on a matrix, 
an idea used by van der Waerden in 1926. But Theorem [1.18] is only one of many 
things happening in 21st century elimination theory, as you will soon see. 


1.3. Elimination Theory in the 21st Century 
by Carlos D’Andrea 


Among the thousands of challenges that this century poses to science we can 
mention: 


e The average citizen should understand (and get excited by!) the latest results in 
science, which explains the increasing effort towards popularization of science 
and scientists done by the media lately, jointly with a more active involvement 
of the scientific community in this task. 

e Research should be more interdisciplinary, this trend probably being pushed 
by the fact that more and more funds are available in the applied-close-to-the- 
real-world problems, making all of us shift our gears in that direction. 

e Computations should be fast, as we have increased enormously our computa- 
tional capacities in the last decades, and still hope for improving them in the 
next years to come. 


These challenges have definitely impacted all areas of mathematics, particularly 
those of a computational nature. For instance, in the last decades we witnessed 
the birth and growth of a large number of software packages in computational 
commutative algebra and algebraic geometry [I][20\/121|/1771[255). 

Elimination theory was not exempt from this trend, and the last few years saw 
us immersed in both bringing interesting (cool!) problems from chemistry, medicine, 
biology to our research as well as proposing new theoretical models and theorems 
coming from these disciplines, see for instance [836]. 

When it comes to fast computation, the “cost” (complexity) of elimination has 
been already studied all along the last century, going well back to the beginning of 
the 1900s with the work of Emmy Noether. Unfortunately, it is well known (see 
for instance for a recent treatment of the topic) that this cost is at least 
exponential in the standard model of computation, and can get worse. 

As problems coming from the real world have to be modelled and solved in a 
reasonable time frame, this bad result from theoretical complexity led us to consider 
changing the “grand picture” of chasing universal solvers for all kinds of problems 
in elimination theory to either relax the input (and expect the output to be relaxed 
too!), or change the computational model to put the relaxation there. 

The 21st century is still unfolding, so as a disclaimer, we are not going to point 
in a general (or several) direction(s) and make prophecies on where elimination 
theory will be 80 years from now, because with very high probability we would fail. 
Instead, we are going to highlight directions that have been explored successfully 
already within this century, and within each category, we will choose one or two 
papers as sampling of how things work and progress in these areas. So, instead of 
having a full meal, what you will find here is a selection of some kind of Spanish 
“tapas” into the 21st century. We hope you enjoy them. 
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Relaxing the Input. We will explore several ways of doing this. 


Tropical Geometry. A very easy way of getting rid of “exponential” objects is 
by computing their logarithms. This idea lies at the core of tropical geometry: 
let V C (C*)” a variety. Its “amoeba” is defined as the image of V under the 
logarithmic map Log, : (C*)”" > R”, Log,(z1,.-.,2n) = (log; |z1|,.--, log: |znl), 
where ¢t € Ryo. In principle, the image of V under this map can become way more 
complicated to describe than V itself (for instance, if V is analytic or algebraic, its 
image is not necessarily an object of the same category). 

However, as t + +00, amoebas get contracted around some asymptotic direc- 
tions which will define “the skeleton” of the tropical variety of V. The limit will 
be a union of pieces of linear varieties which still contain important information 
from the original variety, and are easier and fast to compute, which is what we are 
looking for. Tropical geometry takes its name from “tropical algebra”, a classical 
construction which already existed in 20th century mathematics, which now comes 
into the picture as some kind of “base semi-ring” of tropical varieties. The reader 
will find in and the references therein an introduction and state of the art 
of this fabulous and very active area of mathematics, which now has its own code 
14Txx in the Mathematics Subject Classification of the AMS. 

It should not be surprising that also in tropical geometry there is a notion 
of “elimination”, and that it is of course connected to the “tropicalization” of 
classical elimination theory. Our first “tapa” consists essentially of displaying such 
aresult. Indeed, in Bernd Sturmfels and Jenia Tevelev state the “fundamental 
theorem” on morphisms of tropical varieties. Their main result is the following: 


THEOREM 1.19 (Theorem 1.1 in [338]). Let a: T” — T? be a homomorphism 
of tori and let A : Z” —> Z% be the corresponding linear map of lattices of one- 
parameter subgroups. Suppose that a induces a generically finite morphism from 
X onto a(X). Then A induces a generically finite map of tropical varieties from 
trop(X) onto trop(a(X)). 


Some words to explain the language and notation of this statement: the torus 
T” here is essentially (C*)”, and X is an algebraic subvariety of T”. It is known that 
(see for instance [106]) any homomorphism of tori as above is encoded by a linear 
map, in this case the map is denoted by A. The operator trop in the statement 
of Theorem [L.19]}is the “tropicalization” of a variety which we have sketched when 
we introduced the concept of a tropical variety: replacing the variety by this union 
of pieces of linear varieties jointly with some less subtle structure. This theorem 
states that “tropicalizing” the image of a variety under a homomorphism of tori is 
essentially the same as tropicalizing the variety first, and then applying the linear 
map A. So the algebraic map a gets “simplified” to the linear map A. 

Apart from being of interest on its own, this result has been used to extend 
and simplify the tropicalization of classical objects in elimination theory such as 
resultants. This work has been tackled by Anders Jensen and Josephine Yu in 
220), which is our second “tapa” in this area of elimination and tropical geometry. 
We reproduce here the abstract of this paper: 


We fix the supports A = (Aj,..., Ax) of a list of tropical polynomials and define 
the tropical resultant TR(A) to be the set of choices of coefficients such that 
the tropical polynomials have a common solution. We prove that TR(A) is the 


tropicalization of the algebraic variety of solvable systems and that its dimension 
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can be computed in polynomial time. The tropical resultant inherits a fan struc- 
ture from the secondary fan of the Cayley configuration of A, and we present 
algorithms for the transversal of 7R(A) in this structure. We also present a new 
algorithm for recovering a Newton polytope from the support of its tropical hy- 
persurface. We use this to compute the Newton polytope of the sparse resultant 
polynomial in the case when J R(.A) is of codimension 1. Finally we consider the 
more general setting of specialized tropical resultants and report on experiments 
with our implementations. 

In this excerpt we find phrases like “tropical polynomials” which we have not 
defined here (and we will not), but essentially what we learn by reading the descrip- 
tion of the main results in is that the “tropical resultant” acts as a resultant 
of these tropical polynomials, and turns out to be the tropicalization of the classical 
resultant. This is of interest because it shows that one can compute these tropical 
resultants faster. 

What about discriminants? These objects are also at the heart of elimina- 
tion theory, with very mysterious properties which still have to be understood and 
explored. Their computation is still a challenge even for the simplest cases. But 
tropical geometry has can be of help for this task. Indeed, in the “tapa” [130], Eva- 
Maria Feichtner, Alicia Dickenstein and Bernd Sturmfels succeeded in explaining 
what the tropicalization of the discriminant is. Their main result is the following: 


THEOREM 1.20 (Theorem 1.1 in [130]). For any matrix A, the tropical A- 
discriminant trop(X%,) equals the Minkowski sum of the co-Bergman fan B*(A) 
and the row space of A. 


To make this statement more precise (background and proofs of course can be 
consulted in [130]), any discriminant can be codified by a matrix with integer entries 
(in the case of Theorem [L.20]it is the matrix A). To any such integer matrix A one 
can associate a projective toric variety X 4, and the dual of this variety, denoted 
by X%, is the one defining the discriminant associated to this data. The map trop 
is the same as above, the one which “tropicalizes”. Finally, the co-Bergman fan 
B*(A) is just the tropicalization of the kernel of A. So, Theorem [1.20] essentially 
states that the tropicalization of a very complicated object (the A-discriminant) 
can be easily computed as the Minkowski sum of the row space of the matrix A 
and the tropicalization of its kernel. With this result in hand, the authors give in 
the same paper (Theorem 1.2) an explicit description of the initial monomials of 
the A-discriminant, shedding light into a problem that seemed hard to approach 
for decades. 


“Tailored” Resultants. The main reason why computing resultants is a hard prob- 
lem from a complexity point of view is because the number of monomials of a given 
degree is “big”, in the sense that it grows exponentially as the number of variables 
or the degree increases. It is even worse, the number of solutions of a “generic” 
system of n polynomials in n unknowns with fixed degree is also exponential with 
respect to the degree, and resultants codify somehow these solutions. In an at- 
tempt to avoid this situation, different kinds of “resultants” appeared in the last 
decades, all of them essentially associated to a projection of an incidence variety 
as in from Section [1.2] but replacing the projective space P” with another 
compact variety, and hence having the Elimination Theorem be valid for this kind 
of variety, hopefully making the new resultant easier to compute. This approach 
has been very fruitful both from a theoretical point of view as well for applications, 
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and is still being explored these days. We will only present few “tapas” in this 
section, as there are several important and interesting results in a lot of directions. 
The reader will find the references at the beginning of each title below. 

Among the results obtained here we can mention: 


e Reduced resultants: Even though this operator is quite old (from the previous 
century), it became popular after being introduced by Oscar Zariski in 
to prove some properties of multiplicities of points, and then treated more 
systematically by Jean-Pierre Jouanolou in Section 5.3.29]. This is the 
case when all the input polynomials are generic with the property of “having 
already a common zero”. The variety P” is replaced by the blow-up of the same 
projective space at a point. This common zero can also be considered as “fat”. 
Then the blow-up would be with respect to this multiple point (the defining 
ideal of the fat point). 


e Anisotropic resultants [225][227|[228]: These are resultants of “weighted” ho- 
mogeneous polynomials, i.e., instead of declaring that the degree of the mono- 
mial a; is equal to 1, one sets deg(x;) = d; for positive integers d1,...,dn41. 
The role of projective space P” is now replaced by the “weighted projective 
space” (as defined in [106]) P(do,...,d). 


e Sparse resultants [120)[165]/231|/335|: We encountered these in Section [1.2] 
The variety replacing P” is a toric variety (or a toric cycle, see [120] ) associated 
to the monomials appearing in the expansion of the input polynomials. This 
is why the name “sparse” arises, as the degree or the number of variables may 
be high, but only few monomials may appear in each of the polynomials. The 
complexity of elimination has been shown to be directly related to the number 
of monomials appearing in the input as well as the number of solutions of 
generic sparse systems which only use these monomials, and hence the size of 
these resultants is (at least theoretically) smaller than expected. 


e Parametric resultants [62]: This resultant can be regarded as a generaliza- 
tion of sparse elimination, instead of considering a polynomial as a “sum of 
monomials” we will consider f; = pee cig; (a), with {9;(x)}i<j<n being a 
predetermined family of polynomials. In the classical and sparse case this fam- 
ily would just encode the monomials appearing in the expansion of each fj. 
Under suitable conditions on this family, it can be shown that a variety which 
is parameterized by the g;(x)’s can be used to replace the classical projective 
space in the elimination scheme. 


e Residual resultants [52)(63]: This case is similar to the parametric one just de- 
scribed, but instead we impose suitable conditions on the family {g; (x) }i<j<n, 
for instance to have a “nice” set of zeroes, a complete intersection. In this case, 
it is a generalization of the reduced resultants described above when there is a 
common (non-multiple) zero of the input system.The variety replacing the clas- 
sical projective space in the elimination scheme is now replaced by the blow-up 
of this space along the zeroes of this family. 


Of course there are other resultants which have appeared in the picture in the 
last years: determinantal ([53]), developed ([234]), equivariant ((65]),... This is 
a very fast moving area with plenty of room for implementation, discovery and 
excitement. 
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Implicitization Matrices. Manipulating matrices is at the core of elimination the- 
ory: matrices appear when we want to make explicit some (linear) phenomenon, 
and the classical connection with matrices and commutative algebra was done via 
determinants, the most important object in elimination theory. Determinants or 
minors appear most of the time to measure the failure of a given system to be 
as “generic” as it was supposed to be. From the very beginning of the history of 
elimination theory, determinants have been used to come back from the “matrix 
kingdom” to commutative algebra, and a lot of progress in this area was done to- 
wards the goal of finding nice determinantal formulze for computing resultants and 
eliminants, to give an example of this interaction. 

This approach, however, underwent an important change in the last years with 
the surge of numerical linear algebra. Now we can compute ranks and lots of other 
invariants of huge matrices even with approximated numerical input with very little 
effort. So these ideas should be adapted and used in computational algebra too, 
particularly in elimination theory. We saw a glimpse of this in the Preview of the 
21st Century in Section 

The tapa chosen to illustrate this topic is the paper by Laurent Busé [55]. Let 
us illustrate it with the following example: consider the (affine) parametrization of 
the twisted cubic in affine space A, t + (t,¢?,¢°). If we denote by (x,y,z) the 
coordinates of this space, it is well known that the implicit equations of the image 
are given by the maximal minors of the 2 x 3 matrix 


(1.35) M(z,y,2z) = |: ° 4). 


This implies the following straightforward consequence: a point with coordinates 
(p1,P2,p3) € A® lies in the twisted cubic if and only if rank(M(pj,p2,p3)) < 1. 
Matrices like (1.35) are an example of what in are called “implicitization ma- 
trices”. They should be “easy” to build (for instance, the entries should be linear), 
and also easy to evaluate. Moreover, they should be sensitive to numerical data, 
i.e., when dealing with approximate data, the ansatz should be sensitive to it. But 
the main property of these implicitization matrices is the fact that they drop rank 
after being evaluated if and only if the point of evaluation belongs to the image 
of the parameterized variety. The new “philosophy” here is the following: instead 
of considering all the maximal minors of M(z, y, z) as the generators of the curve, 
just take the matrix itself to codify that information! It is easier to codify, and also 
easy to check for instance membership to the curve. 

In (see also [56]) there are methods to compute implicitization matrices 
for any rational map A” --» A”. These matrices are the result of computing a 
kernel of some kind of “Sylvester map”, and as a result they end up being linear, as 
the example above. It should be noted that these implicitization matrices are not 
a generalization of the structure of M(a, y, z) above, which essentially states some- 
thing stronger, namely the fact that the twisted cubic is a determinantal variety. 
For example, applying the methods in to the parametrization of the twisted 
cubic, one obtains matrices like the following: 


E+yt2z £+y x 
z—1 y-1l gz-1)’ 
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or 


2e+2y -—2e—y 2w+2y4 2z 3x0 — 2y— z x 2x 
y—-1 1 z—1 3/2 1/2 a«-1 
0 y-1 0 z= xz—1 0 
And here is what an implicitization matrix of the parametrization of the unit sphere 
in A® looks like: 


y ytz-ar24+1 say l 0 
y 2y — 2 “x-ytez-l1l -y 
y—-x-1 —22¢ 22 z 


A key feature of these implicitization matrices is the fact that all they are actually 
more a representation of the parameterization itself, and not only of its image (in 
this case, a curve in 3-dimensional space). So, they not only give us a way to test 
for “membership” but also provide with a tool to compute inverse images of the 
parametrization at a given point in the image. 

One may consider implicitization matrices as a generalization of the use of 
resultant matrices to solve overdetermined polynomial systems (again an inversion 
problem). This can be done because resultant matrices are built from syzygies, the 
trivial ones, the Koszul syzygies. In general, looking for implicitization matrices 
demands studying the syzygies of the input. 

Another key feature of representation matrices is: their entries are linear in the 
coordinates of the target space, and testing rank should be fast when dealing with 
specialized coefficients. Moreover, if the ground field is R (“the” ground field for 
graphics and computer aided geometric design) one can use classical methods of 
numerical analysis to present these matrices in a more canonical and easier form 
(like the use of singular value decomposition). For all this, the use of implicitization 
matrices has been shown to be well suited for a lot of problems in graphics and 
computer aided geometric design. Operations like the study of birationality of a 
map, computing inverses, and intersections of varieties encoded this way can be 
done with the aid of these tools in a very efficient way. 

See Section [3.2] for further discussion of implicitization matrices. 


Syzygies and Rees Algebras. These topics display a very nice and interesting in- 
teraction between algebraic geometers and the geometric modelers introduced in 
Section [1.2] In the 1990s, a series of experiments made by Tom Sederberg and 
his collaborators showed something interesting: one can compute the defining poly- 
nomial of a parametric rational curve as a determinant of a matrix whose entries 
are coefficients of what they called moving curves that follow the parametrization. 
In algebraic language, these moving curves are polynomial relations of the input 
polynomials defining the parametrization. The intriguing feature is that the more 
singular the curve is (i.e., the more singular points it has), the smaller the size of 
the determinant. For instance, the following result appears in [314]: 


THEOREM 1.21. The implicit equation of a quartic curve with no base points 
can be written as a 2x 2 determinant. If the curve doesn’t have a triple point, then 
each element of the determinant is quadratic. Otherwise, one row is linear and one 
row is cubic. 


To illustrate this result, we consider the following examples. 
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EXAMPLE 1.22. Set a(s,t) = s* — t*, b(s,t) = —s7t?, c(s,t) = st?. These 
polynomials define a parametrization ® : P! —+ P?, with implicit equation given by 
F(a,y,2) = z+ — y* — xyz. Here is an affine picture of the curve over the reals 
when « = 1: 


0.5 


0.0 
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From the shape of F(a,y,z), it is easy to show that (1,0,0) is a point of 
multiplicity 3 of this curve, and it is also easy to verify that 


Lan s.t) 259, z) = 84% + ty 


is a “moving line” (a syzygy of (a,b,c)) which follows ©. The reader will now 
easily check that the following moving curve of bidegree (3, 1) also follows ®, where 
“following” here means that when we replace x, y, z with a, b,c, the expression below 
vanishes identically: 
L1,3(s,t, x,y,z) = s(y® + 227) +tz*. 

Note that £1,3(s,t, x, y, z) can be regarded as a syzygy on a?, ac”, c?, elements of the 
cube of the ideal generated by the input parametrization. The 2 x 2 matrix claimed 
in Theorem for this case is made with the coefficients of both £1,1(s, t, 2, y, z) 
and £1.3(s,t, x,y,z) as polynomials in s, t: 


z y 
(1.36) F er 5] <P 


EXAMPLE 1.23. We reproduce here Example 2.7 in [96]. Consider 
a(s,t) = s*, b(s,t) = 6s*t? — 41+, e(s,4) = 4s°t — 4st®. 


This input defines a quartic curve with three nodes, with implicit equation given 
by F(x, y,z) = 24 + 4ay? + Qeyz” — 162? y? — 6272? + 16x%y. 
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The following two moving conics of degree 1 in s,t follow the parametrization: 
Lio(s,t,2,y,2) = s(yz—axz)+t(—2? — 2ry + 427) 
Lia(s,ts,z,y,z) = s(y?4 2? Qry) + t1 (xz — yz). 

As in the previous example, the 2 x 2 matrix of the coefficients of these moving 
conics 


Z-—2Z —2? — Qey + 4a? 
(1.37) <i iid 
yo + 32° — 2ry LZ — YZ 
is the matrix claimed in Theorem [1.21] < 


It is known in general that one can compute the implicit equation of a curve 
of degree d from the determinant of a d x d matrix containing the coefficients of 
syzygies (“moving lines”) of the input data. This matrix can be built from the 
Bézout matrix of the parametrization (see [61] Proposition 6.1] for details on this 
construction). For d = 4, this means that the implicit equation of a quartic can be 
computed from the determinant of a 4 x 4 matrix. Theorem [L2I] above essentially 
shrinks the size of the matrix in half. Is there an analogue of this result for higher 
values of d? 

The method of implicitization by using moving curves and surfaces introduced 
by Sederberg and his collaborators in and subsequent articles 
essentially explored this question, and lots of partial results, especially using moving 
conics and quadrics, were obtained at the time. Among them we can mention 
(60) /7'7|9.4\100)[108). 

The main breakthrough in this area was provided by David Cox, who showed 
at the beginning of this century in the tapa [96], that a good way to understand 
all these moving lines, conics, cubics, and their interactions, is by taking a look at 
the kernel K of the following morphism of rings: 


k[s,t, 0, y,2| — ks, t, €] 
(1.38) s,tr>s,t 
L,Y, Z +--+ LE, ye, Ze. 


Here, & is the ground field where the parametrization is defined, and e a new 
variable. Indeed, it is shown in Nice Fact 2.4] (see also for the case 
when ® is not generically injective) that all the moving curves that follow the 
parametrization ® are bihomogeneous elements of K, and they generate this ideal. 

Let us say some words about the map (138). Denote with J C kjs,¢] the 
ideal generated by a(s,t), b(s,t), c(s,t). The image of is actually isomorphic 
to k[s,t][eZ], which is the Rees algebra of I. By the Isomorphism Theorem, we 
then get that k[s,t,x,y,z]/K is isomorphic to the Rees algebra of I. This is why 
the generators of K are called the defining equations of the Rees algebra of J. 
The Rees algebra that appears in the method of moving curves corresponds to the 
blow-up of V(J), the variety defined by I. Geometrically, it is just the blow-up 
of the empty space (the effect of this blow-up introduces only torsion), but this 
geometrical perspective should explain somehow why moving curves are sensitive 
to the presence of complicated singularities. As of today it is not well understood 
why the description of K actually gets much simpler if the singularities of the curve 
are more entangled. 

Indeed, it has been shown in [54], by unravelling some duality theory, that 
for any proper parametrization of a curve of degree d having a moving line of 
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degree 2 and only cusps as singular points, the kernel K has (aria 4) +5 minimal 
generators. On the other hand, in joint work with Teresa Cortadellas (see 
also [247]), we have shown that if there is a moving line of degree 2 and there is 
a point of higher multiplicity, then the number of generators drops to | 8), Le., 
the description of K is simpler in this case. In both cases, these generators can be 
made explicit, see [54]/90\(247]. 

Further evidence supporting this claim is what we already know for the case 
when there is a moving line of degree 1, which was one of the first cases worked out 
by several authors [54/89\/TOT270). It turns out (cf. Corollary 2.2]) that y = 1 
if and only if the parametrization is proper (i.e., generically injective) and there is 
a point on the curve with multiplicity d— 1, which is the maximal multiplicity a 
point can have on a curve of degree d. In this case, K has d+ 1 minimal generators, 
of bidegrees 


{(0,d), (1,1), (,d—1), (2,d- 2),...,(d-1, 1)}, 


and one can show easily that the determinant of the 2x2 matrix formed by collecting 
the coefficients of the forms of bidegrees (1,1) and (1,d— 1) in this family as it 
was done in and (137), gives the implicit equation of the curve. So, for the 
case pt = 1 we can compute implicit equations with determinants of 2 x 2 matrices, 
independent of the value of d. 

In both cases (4 = 1 and ys = 2), explicit elements of a set of minimal generators 
of K can be given in terms of the input parametrization. But in general, very little 
is known about how many they are, what their bidegrees are, and how to compute 
them. This is a very active and promising are of research, with several results 
already obtained and yet more to be discovered in 
the following years... 

This topic will be further elaborated in Sections [3.2] and [3.3] 


Changing the Computational Model. The rigidity of Turing’s model of com- 
putation (which is still the standard to this day) combined with the fact that, to 
study the complexity of an algorithm in this model, we must consider all the possi- 
ble scenarios and most of the time take into account the worst case of each of them, 
sometimes leads to this unpleasant exponential number of steps that haunts elimi- 
nation theory. To overcome this problem, alternative models and algorithms have 
been proposed since the very beginning of the computer era. Several of them are 
still being explored and implemented, as they turn out to be very useful for prac- 
tical computations. We will go through some of them, highlighting a few pearls in 
elimination theory which have appeared in this century. 


Probabilistic algorithms. These algorithms make at some point “random” choices 
which have (most of the time) very high probability of success, and continue carry- 
ing on with their task. In this way, one can avoid a large number of operations and, 
with some luck, get to the right answer. An example of such an algorithm with 
strong implications in elimination theory was given by Gabriela Jeronimo, Teresa 
Krick, Juan Sabia, and Martin Sombra in [221]. Indeed, in the abstract of this 
article we read: 

We present a bounded probability algorithm for the computation of the Chow 

forms of the equidimensional components of an algebraic variety. In particular, 

this gives an alternative procedure for the effective equidimensional decompo- 

sition of the variety, since each equidimensional component is characterized by 
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its Chow form. The expected complexity of the algorithm is polynomial in the 

size and the geometric degree of the input equation system defining the variety. 

Hence it improves (or meets in some special cases) the complexity of all previous 

algorithms for computing Chow forms... 

... As an application, we obtain an algorithm to compute a subclass of sparse 

resultants, whose complexity is polynomial in the dimension and the volume of 

the input set of exponents. As another application, we derive an algorithm for 

the computation of the (unique) solution of a generic overdetermined polynomial 

equation system. 

Before describing roughly how this algorithm works, we will just say that the 
“Chow form” of an algebraic variety is a polynomial in a lot of variables which 
generalizes all the resultants described above. 

What follows a rough description of the algorithm. Note that no “general 
formula” or “determinantal” or “matrix form” appears in the different steps of this 
procedure, as in probabilistic algorithms one should use different shoes to walk 
through the resolution of a problem. 


Computing the Chow Form of a variety : 
Given (equations of) a d-dimensional variety V C P”, 


(1) Intersect V with d “general” (probabilistically chosen) hyperplanes to obtain 
finitely many points 


VO{M, =...Ha = 0} = {&1,...,€p} CP”. 


(2) From (a “geometric resolution” of) the set {&,...,€p}, lift via Newton’s 
method to the Chow Form of 
VA{H, =...= He-1}. 


Note that the latter set should be now one-dimensional. 
(3) Iterate this process d— 1 times to finally get the Chow Form of V. 


As explained in that paper, this algorithm is a probabilistic Blum-Shub-Smale ma- 
chine over Q which manipulates straight line programs. These machines are gener- 
alizations of classical Turing’s machines to the rationals in the sense that instead of 
manipulating only 0’s and 1’s, any number over Q is treated in the same way. As 
explained in the book [33], a BSS machine is a Random Access Machine with reg- 
isters that can store arbitrary numbers and can compute rational functions at unit 
cost. So, for this model, representing any number has a very cheap cost. There are 
several challenges in elimination theory within the formulation of BSS machines, as 
we will see soon. 

What are straight-line programs? These are very old friends of ours, although 
hard to imagine that they can have such a relevant role in modern computer al- 
gebra: a straight-line program is an algorithm (a “program”) with no branches, 
no loops, no conditionals, no statements, no comparisons, just a sequence of basic 
operations. A basic straight-line program is the evaluation of a polynomial with 
integer coefficients at a given point: one does not have to stop at any intermediate 
step to compare numbers or decide to change the formula if some prescribed value 
is reached or avoided. Of course the hard part in designing efficient straight-line 
programs is to get fast and useful procedures which solve a given task without bi- 
furcations. For instance, evaluating a polynomial at a given point the way we learnt 
in high school probably is not a good straight-line program, as using the Horner 
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scheme of evaluation is another of such straight-line programs which is better from 
a complexity point of view. 

Straight-line programs have a rich and interesting history in computational 
algebra and algorithms. The interested reader can learn more about them in [170 
[171/243]. 

Another highlight in this model of computation and using similar tools was 
given recently by Gabriela Jeronimo and Juan Sabia in [222], where they succeeded 
in producing a straight-line program which evaluates resultants in polynomial time. 
The abstract of their paper states: 

We prove that the sparse resultant, redefined by D’Andrea and Sombra and 

by Esterov as a power of the classical sparse resultant, can be evaluated in a 

number of steps which is polynomial in its degree, its number of variables and 

the size of the exponents of the monomials in the Laurent polynomials involved 

in its definition. Moreover, we design a probabilistic algorithm of this order 

of complexity to compute a straight-line program that evaluates it within this 

number of steps. 


To accomplish this task, the authors use very cleverly a geometric resolution of a 
toric variety jointly with the Newton Hensel method and Padé approximations. 


Parallelization. If the resolution of a system of polynomial equations becomes 
exponential because the number of roots to be computed is already exponential, 
there is some hope that by relaxing the problem and requiring just to compute 
“one root”, the number of steps may reduce. However, to choose one root from 
an algebraic system we must leave the algebraic representation that we are used 
to in algebra, and deal with the language and the tools that produce roots in 
the computational and numerical world. One of these powerful tools is Newton’s 
method, which lies at the heart of several probabilistic algorithms like the two we 
just mentioned above. Both in theory and in practice, Newton’s method is very 
fast and efficient provided that the Newton operator applied to the initial point 
converges, which seems to happen most of the time, but unfortunately it is not 
very easy to certify when Newton’s method will succeed at a given point. 

Inspired by this, we will say that a point z) € C is an approximate root of 
f(x) € C[a] if Newton’s method applied to zo converges to a root of f(x) in C. 
The analogue in n dimensions is straightforward: one needs a point in C”, and a 
square system (n polynomials in n unknowns) of polynomials so that the Newton 
operator is also defined, and approximate roots of this system will be those n-tuples 
in C” converging to an exact zero of the system after successive applications of the 
multivariate Newton operator. 

In 1998 Steve Smale ({323]) listed 18 problems that, in his opinion, would be 
challenges for the 21st century. Among them, problem number 17 asks for the 
existence of a deterministic algorithm for computing an approximate solution of 
such a square system of complex polynomials in time polynomial, on the average, 
in the size of the input system. Like most of the problems in that list, this one 
received a lot of attention and important progress was done towards its resolution 
in the first years of the century, see [23][24|[/51]. But it was Pierre Lairez ((249]) 
who finally tackled the problem and solved it in 2017. The abstract of his paper 
states: 

We describe a deterministic algorithm that computes an approximate root of n 

complex polynomial equations in n unknowns in average polynomial time with 
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respect to the size of the input, in the Blum-Shub-Smale model with square root. 
It rests upon a derandomization of an algorithm of Beltran and Pardo and gives 
a deterministic affirmative answer to Smale’s 17th problem. The main idea is to 
make use of the randomness contained in the input itself. 


Homotopy Methods. All the papers that dealt with Smale’s 17th problem mentioned 
above used this classical mathematical idea which turns out to be very fruitful from 
a computational point of view: if you want to solve a very complicated system f = 0, 
you can start with an “easy” system g = 0, and then “connect” both systems with 
an homotopy of the type tf + (1—t)g = 0, with t € C a new parameter, and then as 
t goes from 0 to 1 through some path over the complex numbers, then your “easy 
roots” will become the roots of the complicated system f = 0. Of course a lot of 
complications must be overcome here. For instance, what is a “right path” to take 
in the homotopy? Can we reach all the roots of the system f = 0 this way? Can 
we lose roots along the path? How can we do all these tasks with a computer and 
get the right answer? 

A lot of research has been done around these problems, with very important 
results achieved in the last decade, and a lot of challenges remain in front of us. 
These homotopy methods are at the basis of a new area called numerical algebraic 
geometry [194], which will be explained and illustrated in detail in Chapter Blof this 
book, so we are not going to add more on this topic. We only mention that with this 
naive idea of performing homotopies, several algebraic invariants of varieties can be 
computed with precision, like dimension, degree, number of irreducible components, 
and more. 

We close this selection of “tapas of the 21st century” with a very nice result 
obtained by Sandra Di Rocco, David Eklund, Chris Peterson, and Andrew Sommese 
in [125], where homotopy methods were used to compute the Chern numbers of 
a smooth algebraic variety. This should definitely convince you that numerical 
algebraic geometry has already established its place in this century, and lots of 
interesting and useful results are appearing and will continue to appear in this 
area. 


Conclusion. We have explored above a selection of tapas/results that should 
make you more like “feeling the flavor” (and not the highlights) of the state of the 
art in elimination theory in this century. Due to lack of space (and also personal 
knowledge), a lot of topics have not been covered here like differential/difference 
elimination theory, real elimination, direct applications of this area, or implemen- 
tations. We hope that with this sampling of results, you will be convinced that 
there are several interesting and exciting challenges in elimination theory for the 
following years to come. Salud! 


CHAPTER 2 


Numerical Algebraic Geometry 


In applications, we rarely have the luxury of knowing the exact solutions of 
a system of polynomial equations; approximations by floating point numbers are 
usually the best we can do. In fact, the coefficients of the polynomials are often 
floating point numbers themselves. Thus numerical issues become important when 
solving polynomial systems in the real world. 

The first section gives simple examples to illustrate some of the numerical 
complications that arise and explores the power of numerical linear algebra. In 
the second section we discuss the important technique of homotopy continuation 
and give some interesting examples. In the third section, Jonathan Hauenstein will 
discuss some recent applications of witness point sets. 


2.1. Numerical Issues and Linear Algebra 


For someone trained in classical algebraic geometry, working with floating point 
numbers can be an unsettling experience. We start with the binomial theorem. 
EXAMPLE 2.1. The binomial theorem gives the identities 
(a — 1)® = 2° — 62° + 15a* — 2027 + 152? — 62 +1 
= (((((@ — 6)a + 15)a — 20) + 15)a — 6)a +1. 
In algebra, evaluation is a ring homomorphism, so that evaluating these expressions 
at areal number gives the same answer. However, van Loan p. 44] notes that 


Algorithms that are equivalent mathematically may behave differently numerically. 


To see what this means, we convert the above identities into three MATHEMATICA 
functions, where N[...,16] means working with 16 digit floating point numbers: 
Flx_] = NEC - 1)76, 16] 
G[x_] N[x*6 - 6 x75 + 15 x74 - 20 x°3 + 15 x°2 -6x+1, 16] 
H[x_] = NE(((((x - 6) x + 15) x - 20) x + 15) x - 6) x + 1, 16] 
If we evaluate F, G, H at 200 equally spaced inputs in the interval [.995, 1.005] using 
16 digits of precision, we get the pictures: 


\_) Nan ead 
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Three algebraically equal expressions evaluate to very different graphs! The 
graph for G is the worst since it involves the most multiplications. But now redefine 
G to use 26 digits of precision: 

G[x_] = N[x*6 - 6 x75 + 15 x°4 - 20 x73 + 15 x°2 -6x+1, 26] 
When we plot values at 200 equally spaced inputs in the interval [.995, 1.005], we 


get the graph: 


To the human eye, this is Sidiaincdenents from the graph of (a — 1)°®. 


< 


In numerical algebraic geometry, it is often necessary to increase the precision 
automatically during certain parts of a computation. This is the idea of adaptive 
precision that we will encounter in Section [2.2] 

Our second example is taken from Ideals, Varieties and Algorithms [104]. 


EXAMPLE 2.2. To find the minimum and maximum values of the function 
x® + 2Qxyz — z* on the unit sphere x? + y? + z? = 1 in R%, we use Lagrange 
multipliers to get a system of equations in A, x, y, 2: 

3a? + Qyz = r+ Qa 
2uz=X-2y 

Qty —2z=A-2z 
ety te27 = 1, 


(2.1) 


A lex Grébner basis with \ > a > y > z is given by the eight polynomials 


3 3 167616 , 36717 , 134419 , 
gn =A £ YZ ze Zz z 
n° 9 3835 590 7670 
gar+y+z-1 
— ay — 195845 , 1999 5 6403 
93 = 80 ~ 3835 295 3835 
=e 4 yg? _ 52,5 108 5 , 2556 | 
—— ¥? ~ 38357 — 205° 3835 
9216 . 906. 2562 
fe 2 5 3 
= = 
95 = Y FU —Y~ e956” + 5957 — 39357 
>. 6912, 827, 3839 
G6 =Y 2 z 4 z z 
3835 295° 3835 
576. 1605 , 453 
_ 3 64 4 2 
eae eg te 
1 ral 4 rT 
pat Ey HI Ne yea (a By 
1152° 1152” 288 9 128 


The final polynomial gg has roots z = 0, +1, +2/3, +V/11/(8V2), all real and exact. 


As in [104] Example 3 of Chapter 2, §8 


solutions of (2.1), again all real and exact. 


, one can work backwards to find the 10 
Evaluating x? + 2ryz — z? at these 


solutions makes it easy to determine the desired extreme values. 
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This example is lovely but does not represent what happens in most applica- 
tions. To get a sense of what’s involved, let’s return to (2.1) and change the middle 
coefficient of 23 + 2xryz— 2? from 2 to nearby values such as 2.1 or 2.01. So consider 
x® + cxyz — 27, where c is a parameter. Finding the extrema of this function on 
x? +y? + 27 =1 leads to the system of equations 

3a? + cyz =A- 2a 
cuz = X- 2y 


22) cry —2z=- 22 


gety? +27 =1. 
For c = 2.1 = 21/10 and c = 2.01 = 201/100, a lex Grdbner basis with 
A\>a«a>y > z still consists of eight polynomials, where gg = gg(z) eliminates all 
variables but z. A surprise is that gg has degree 9 for these new values of c, while gg 


had degree 7 for c = 2. The following table shows the roots of gg for c = 2, 2.1, 2.01 
to four decimal places: 


c first seven roots of gg two new roots 
2 |/0 +1 +2/3 +/11/(8V2) 

=+.6667 =+.2932 
21/0 +1 +.6592 +.3139 +14.9447,/-1 
2.01}0 +1 +.6659 +.2957 +142.2176,/—1 


As c > 2, the first seven roots approach the exact values found when c = 2, while 
the two new roots approach oo (2.01 is ten times closer to 2 than 2.1, and the new 
roots are correspondingly ten times larger). 

For the whole system (2.2), there are 12 solutions when c = 2.1 or 2.01. Two 
of these go to oo as c — 2, leaving 10 finite solutions when c = 2. This analysis 
reveals that the original system (2.1) has solutions at oo. <P 


Numerical Algebraic Geometry. Examples2J)and[2.2lillustrate the challenges 
that arise when we combine polynomials and floating point numbers. The difficulty 
of doing algorithmic computations with real and complex numbers has been around 
for a long time. Five hundred years ago, Leonardo da Vinci observed that 


Arithmetic is a computational science in its calculation, with true and perfect 
units, but it is of no avail in dealing with continuous quantity. p. 201] 


Numerical analysis was invented to address this difficulty, and applied to systems 
of polynomial equations, the result is numerical algebraic geometry. 
When dealing with polynomials over R or C, there are several paradigms for 
how to proceed. Here are brief comments about two, as applied to the system (2.2): 
e In [329], Stetter uses the language of empirical coefficient, where “c = 2 
within +0.1” means that |c—2| < 0.1 is likely, 0.1 < |c—2| < 0.2 is possible, 
and |c — 2| > 1 is unlikely. 
e In [324], Sommese and Wampler regard c as a parameter with a generic 
behavior (12 solutions). Then homotopy continuation is used for a specific 
value of the parameter such as c = 2 (10 solutions). 


We will explore the Sommese and Wampler approach in Section22.2]when we discuss 
homotopy continuation. 
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We should also mention the powerful framework created by Blum, Shub and 
Smale. Their methods involve a-theory and what are now called BSS machines. 
See for the full story and [196}/249] for applications to algebraic geometry. 


Linear Algebra and Algebraic Geometry. It is time to do some algebraic 
geometry in a numerical context. Here, we will focus on the interaction between 
linear algebra and algebraic geometry. In Section|2.2] we will turn our attention to 
homotopy continuation. 

The basic idea is that when a polynomial system has finitely many solutions 
over C, basic tools from linear algebra (bases, eigenvalues, eigenvectors) tell us a 
lot about the solutions. We first review the basic theory and then discuss what 
happens numerically. The starting point is the following classic result. 


THEOREM 2.3 (Finiteness Theorem). Let I C C[x1,...,2n] be an ideal. Then 
the following are equivalent: 
(1) V(Z) CC” is finite. 
(2) dim C[a1,...,a]/I < oo. 
(3) I has a Grobner basis G such that for alli, there is g € G with LT(g) =a;"'. 
(4) Every reduced Grobner basis of I has the above property. 


Furthermore, when this happens, the cosets of the monomials 
B= {x* =a7---ay” | x ¢ (LT(I)) = (LT(G))} 
form a basis of C[a1,...,Up]/I (necessarily finite). 


An ideal I as above is zero-dimensional since V(J) consists of finitely many 
points and hence has dimension zero as a variety. The monomials «°® € B are called 
remainder monomials. The set B has two properties: 


e Bis an order ideal: if * € B, then any monomial dividing x is also in B. 
e Bis connected to 1: if x° € B, then either x® = 1 or 7° = a,x? for some 2; 
and some «° € B. 


The upshot is that once we have a zero-dimensional ideal I C C[{a1,...,2n], we get 
the finite-dimensional vector space C[a1,...,@,]/Z and a nice monomial basis B. 
We also have some interesting linear maps defined on this vector space. 


Eigenvalues of Multiplication Maps. Since C[21,...,2,]/I is a C-algebra, a 
polynomial f € C[a1,...,@n] gives the linear map 


my: Clati,...,%n]/2 — Clni,...,¢n]/T 
defined by my([g]) = [fg]. In 1981, Lazard proved the following result. 


THEOREM 2.4 (Eigenvalue Theorem). Assume dimC[#1,...,2n|]/I < co and 
f €Clai,...,an]. Then 


{eigenvalues of ms} = {f(p) | p€ VD}. 


When we set f = 2;, it follows that the eigenvalues of the linear map mz, are 
the ith coordinates of the solutions. So we have reduced solving equations to an 
eigenvalue problem! Here is a simple example. 
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EXAMPLE 2.5. The polynomial system 
fi = 2? + 2y? — 27 =0 
fo = ay” — zy =0 
fg =y? —2y? +y =0 


has Groébner basis { fi, fo, fg} with underlined leading terms for lex order with 
xz > y. Now apply the Finiteness Theorem: 


l| 


e There are finitely many solutions since LT(f,) = x”, LT(f3) = y?. 
e The remainder monomials form the order ideal B = {1,y, y?, 2, xy}. 
e Cix,y|/( fi, fe, fz) has dimension 5. 


The matrices of m, and my relative to B = {1,y,y,z, xy} are 


0 0 0 O 0 0 0 0 0 0 
0 0 0 2 2 1 0 -1 0 O 
Mz=|0 0 0 -2 —-2 M,= |0 1 2 0 0 
1 0 0 0 0 0 0 0 0 0 
0 1 1 0 O 0 0 oO 1 1 


The first two columns of M, follow since m, maps the basis elements 1,y to the 
basis elements x, xy. For the third column, fz = xy? — ry implies 
me([y7]) = [xy] = [xy]. 

The other entries are also easy to find. Let’s compute eigenvalues: 

e CharPoly y,, (u) = —u? implies that 0 is the only eigenvalue of mz. 

¢ CharPoly jy, (u) = —u?(u —1)° implies that 0,1 are the eigenvalues of my. 
By the Eigenvalue Theorem, all solutions have x = 0 and y = 0,1. Hence the 
solutions are (0,0), (0,1), found by linear algebra! < 


The Eigenvalue Theorem can be modified to take multiplicities into account. 
Suppose I = (fi,..., fs), so that p € V(Z) has a multiplicity m/(p, J) as a solution 


of the polynomial system f; = --- = fs = 0 (see Definition 4.2.1]). Then 
Theorem can be stated as 
(2.3) CharPoly y,, (u) = II Fp au)yee 

pevi(l) 


(see Proposition 4.2.9]). When f takes distinct values at the solutions, (2.3) 
implies that m(p,J) is the multiplicity of f(p) as a root of CharPoly,,,. We can 
see this in Example where the solutions have distinct y-coordinates. Hence 
CharPoly y;, (u) = —u?(u — 1) tells us that (0,0) has multiplicity 2 and (0,1) has 
multiplicity 3 as solutions of fy = fo = fz = 0. 


Eigenvectors of Multiplication Maps. Theorem [2.4] gives the coordinates of 
the solutions but doesn’t explain how to match them. One way is to observe that 


Mq,Me; = Me;Mz, for all 1,7 since C[x1,...,@,]/I is commutative. We know that 
simultaneous eigenvectors exist in this situation, and for a simultaneous eigenvector 
v, the corresponding eigenvalues mz,(v) = piv give the solution p = (p1,...,Dn). 
In general, 


{my | fe Cleis.se)} 
is a commuting family of linear maps on C[a21,...,2,]/Z. This leads to a rich 
algebraic structure explored in [241]. 
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Another approach is to use the dual map 
me: (Clz1,+.+,2n)/2)” —> (Clai,...52n)/2)": 
If My is the matrix of my with respect to the basis B, then 
eigenvectors of my <—> eigenvectors of M; +—» left eigenvectors of My 


since the transpose MF is the matrix of my with respect to the dual basis of B. 
In 1988, Auzinger and Stetter showed that a solution p € V(J) gives a left 
eigenvector of My as follows. 


THEOREM 2.6 (Eigenvector Theorem). Assume dimC[x,...,@n]/I < oo with 
monomial basis B= {¢%?,...58°"}, and for f © Clai,...4%,), let My be the 
matrix of mz with respect to B. Then: 

(1) Every p € V(1) satisfies 
(p™,...,p°’)My = f(p)(p™,...,p’), where p™ = x“ (p). 
(2) If f takes distinct values on V(I) and My is non-derogatory, then (1) gives all 
eigenvectors of My up to scalar multiplication. 


Recall that a matrix My is non-derogatory if its minimal polynomial equals 
its characteristic polynomial, or equivalently, each eigenvalue has a unique Jordan 
block in the Jordan canonical form of My. 


EXAMPLE 2.7. Consider the polynomial system from Example[2.5] where B = 
{1,y,y?,x, xy} gives a basis of C[z,y]/(f1, fo, fs). One can check that M, and 
M, are derogatory, but Mo,+43, is non-derogatory and has eigenvalues 0,3 with 
corresponding left eigenvectors 

1, y, y?, @, zy a, 

1 1 1 

0: (1, 0, 0, O, 0) ; (0, 0) 
(LL 1 0, ae so the solutions are ‘0 1) by Theorem [2.6] 


Sometimes My is always derogatory—see [97|/241] for the algebra and geometry 
that govern whether or not non-derogatory multiplication maps exist. <— 


Numerical Linear Algebra. It is now clear that linear algebra can help solve 
zero-dimensional polynomial systems. But Examples [2.5] and 2.7] have very simple 
eigenvalues. In applications, the eigenvalues tend to be more complicated, which is 
where numerical linear algebra enters the picture. 


For a vector u € C”, we use the standard 2-norm ||u||2 = 4/7}, |wi|?.. Then 
the 2-norm of an n x n complex matrix A is 


||All2 = max ||Aulle, 
lPull2=2 


which records the maximum “stretch” of A. When A is invertible, we define the 
2-condition number of A to be 


cond,(A) = ||A~* 2 - || Alle. 
It is easy to see that condg(A) > 1, with equality when A is unitary. The intuition 
is that when cond2(A) * 10%, one loses roughly a digits of accuracy when solving 


the linear system AZ = b. Thus computing accurate solutions gets more expensive 
as cond2(A) increases. 
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EXAMPLE 2.8. The 5 x 5 Hilbert and Pascal matrices are 


pian 2 La th 4 
5 eee es 
2 3 4 5 6 123 4 5 
H,;=|1 111 11, P=|1 3 6 10 15 
3 4 5 6 7 
a 1 4 10 20 35 
pati 1 5 15 35 70 
5 6 OF CB. CG 


These matrices have determinants det(H;) = 3.7493 x 10~ and det(Ps) = 1. Since 
Cramer’s rule has the determinant in the denominator, you might expect H; 7 = b 
to be harder to solve numerically than P;z = b. This is true, but 


cond2(Hs) + 476607.6, cond2(Ps5) ¥ 8517.5 
implies that we lose 5 decimal digits of accuracy using Hs and 3 using P;. <- 


You can read more about condition numbers, including their relation to the 
singular value decomposition (SVD), in Section 2.6]. 


Generic Multiplication Matrices. Since the multiplication matrices M;,, can 
be used to solve zero-dimensional polynomial systems, it makes sense to ask how 
hard they are to compute accurately. We will explore a particular case based on the 
2018 paper A Stabilized Normal Form Algorithm for Generic Systems of Polynomial 
Equations by Telen and Van Barel [343]. 

When f,g € C[x, y] are generic of degree d, some very nice things happen: 


e B= {xy |0< a,b <d-—1} isa basis of C[z, y|/(f,g). Note that B is an 
order ideal with |B| = d?. 

e The system f = g = 0 has d? solutions in C? by Bézout’s Theorem. 

e There are formulas for the multiplication matrices M,, M,. 


In Example 2.5) we used a Grobner basis to find the multiplication matrices. But 
here, no Grobner basis is needed—we can compute M, and M, directly from f, g. 
The full story is explained in Section 3.6], so we will instead just give the 
basic idea of how this is done. 

The first step is to create a Macaulay matrix M similar to the ones encountered 
in Chapter [I] We express the products xy! f,x'y/g for i+ j < d—1 in terms of 
monomials of degree 2d — 1, where columns corresponding to monomials of B are 
on the right. Here is an example of M with d= 2 taken from [B43]. 


EXAMPLE 2.9. Consider the system: 
faott+y?-2, g=3827-77% -2. 


Here, B = {1,x,y,xy}, and when we express f,af,yf,g,2g,yg in terms of the 
monomials of degree 3, with 1,x,y, xy, we get the matrix (omitting zero entries) 


a xy xy? y? x? y? 1 x y “Ly 
f 1 1|-2 
, _ of 1 1 —2 
M=(M,| Mj] =“ 3-1] -2 
xg 3 —1 —2 
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Since M, is an invertible 6 x 6 matrix, we can multiply each side by M;' to obtain 


x3 xy xy” y?> x? y? ig z. y “Ly 
ea 1 —1 
ayy 1 —1 
= xy? 2 1 —1 
(Ie | My *M2) = a 1 “4 : 
e—-1 1 —1 
yal t | =1 


where each row label is the polynomial determined by the coefficients of the row. 
The underlined monomials will be explained shortly. 

Multiplying by M;* corresponds to a series of row operations on (M, | M2), 
which means that the above row labels are elements of the ideal (f,g). In particular, 
the polynomials z?y — y, ry? — x, 27 — 1, y” — 1 are in (f,g). 

The columns in the above matrices are labeled by monomials of degree < 3. 
Plotting their exponent vectors gives the picture 


(0) 


@e 
B 
ee [e) 


where B is shown in black and the underlined monomials give the boxes. Note 
that the boxes line on the border of B—they are border monomials, and the four 
polynomials xy — y, ry? — x, x2? — 1,y? —1 form a border basis of (f,g). You can 
read more about border bases in [232]. 

Once we have the border basis, the multiplication matrices 


el ae wy way yl oye yy yay 

1 0 1 O 0 1 0 oO 1 0 

x 1 0 O 0 x 0 O O 1 

Be y 0 O O 1 ae y 1 OO 0 0 
ry 0 oO 1 0 xy 0 1 0 0 


are easy to find. When we multiply B by x, we get either an element of B or a 
border monomial. For the former, the column of MM, is obvious, and for the latter, 
the border basis gives the column. The same thing happens with M,, so you can 
see the close link between border bases and the multiplication matrices. << 


The same happens for generic f,g of degree d. In this case, the Macaulay 
matrix M = (M,| Mp) has size 2(*5') x 41") = d(d+1) x d(2d+1), where M, is 
d(d+1) x d(d+1) and invertible. The process described in Example2.9] generalizes, 
where appropriate rows of (q(a41) | My ' Mz) give a border basis, from which M, 
and M, are easily computed. 

The upshot is that we have a lovely method to compute M, and M,. But 
while the theory is wonderful, the reality is that the above process can be ill- 
conditioned numerically. Looking back at Example you can see that the key 
step is multiplying by M4, ' Hence, in order to know accurately we have computed 
M, and M,, we need to know the condition number cond2(M;). 
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In [343], Telen and Van Barel compute the condition number cond2(M,) for 
random f,g of degree d with 1 < d < 20: 


cond, (M, ) 
107° 


101° fe =I 


0 | | l 
” 5 10 15 20 


degree d 
These are poorly conditioned matrices! 

This is where numerical linear algebra comes to the rescue. The hope is that 
switching from B to a different basis B’ of C[x, y]/(f,g) will lead to smaller condi- 
tion numbers. The paper shows how to find a monomial basis B’ with this 
property. Their strategy is to do numerical linear algebra on M = (Mj |Mz2) before 
inverting M, by using QR factorization with optimal column pivots. Since columns 
correspond to monomials, switching columns gives a different monomial basis B’. 

The full details are given in Algorithm 1]. Here is an example with d = 10 
that shows the original monomial basis B and the new basis B’: 


monomials in B monomials in B’ 


Sggesssess 


On the left, we see the nice order ideal B, while the monomials in B’ are more 
scattered—neither an order ideal nor connected to 1. But the condition numbers 
are much better: 


cond2(M;) 
107° ae : 
< condition numbers using B 
1919 = | 
+ condition numbers using B’ 
10° | | | 


5 10 15 20 
degree d 
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It is clear that from a numerical point of view, one wants to compute multiplication 
matrices using B’, not B. 

Another way to measure the improvement is to compute residuals. This works 
as follows. Simultaneously diagonalize M,, M, to get approximate solutions (2;, y;), 
and normalize f and g so that the vector norm of their coefficients is 1. Then the 
residual is the maximum of $(|f (ai, yi)|+|9(xi, ys)|) over all solutions. Here, smaller 
residuals are better. 

In [343], residuals are computed for random choices of f,g with 1 < d < 20, 
using both B and B’. The improvement is dramatic: 


residual 


| < residuals using B 


< residuals using B’ 


degree d 


The paper focuses on the dense case, while its follow-up by Telen, 
Mourrain and Van Barel extends the analysis to sparse polynomials. 


Algebraic Oil. Our final example involves a collaboration between Shell Interna- 
tional and the Universities of Genoa and Passau. The goal is to model in a well 
using algebraic geometry and numerical linear algebra. Papers written on this sub- 
ject include and [242]. The authors of are Kreuzer (Passau), Poulisse 
(Shell) and Robbiano (Genoa). 

For concreteness, we will focus on an example from by Baciu and Kreuzer 
that involves the two-zone well shown here: 


(2.5) 
LT 
valve position zone A 


2 


valve position zone B 
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Although the picture looks nice, we actually know little about the underground 
geometry beyond the fact that there are two zones. So it’s not possible to build 
a model of oil production in advance—one needs to create the model based on 
observations. 

In the picture (2:5), x1 and x2 record the settings of the valves at A and B. 
Altogether, there are seven variables in this model: 


x1 = v4 = valve opening at A 

L2 = Vg = valve opening at B 

x3 = pa = pressure drop across valve A 
(2.6) t4 = pp = pressure drop across valve B 

L5 = PAB = pressure loss between A and B 

v6 = Ptr = pressure loss A to the surface 


L7 = Pgas = gas production (related to another pressure) 


We want to find a set of monomials in 7),...,27 such that 
(2:7) oil production % cya" +--+ +c." 
for constants c,,...,¢;. We like monomials because they can represent interactions 


between variables. For example, 7124 = v,app measures how the setting of value A 
mediates the influence on oil production of the pressure drop across valve B. 

In this setting, a data point consists of (p,P) € IR’ x R, where p records the 
values of 41,...,27 and P € R is the corresponding oil production for these values. 
There are 5500 data points, giving 


X = {pi,.-., P5500} C R’ 
and oil production 
P = (Pi,..., P5500) € R™. 
A Naive Model. For the moment, assume that our data is exact and that we can 


do exact computations. The vanishing ideal of ¥ is 
(Xv) = {f € Ria,...,27] | f(pi) = 0, 7 =1,..., 5500}. 


Then evaluation at ¥ = {p1,...,ps5500} induces an R-algebra isomorphism 
Rlay,..., 27] /1(¥) 22s R550, 
If B is a monomial basis of R[x1,...,27]/I(4), then evaly(B) is a basis of R°°??. 
Expressing the production data P = (P;,...,Pss00) in terms of this basis gives 
P= x Caevaly (x). 
z*EB 


Then the polynomial F = )> «cp CaX™ satisfies perfectly. 

From the point of view of interpolation, this works nicely since F(p;) = P; 
for all 7. But from a modeling perspective, it is terrible, since the model is overly 
complex (5500 monomials!) and suffers from the classic problem of over-fitting. In 
general, one wants to avoid situations where “the model is the data.” 

A better way to proceed is to pick a subset By C B so that Bo is reasonably 
small and P € R°°° is reasonably close to the subspace Span(evaly (Bo)) C R°°?. 
Let Po denote the orthogonal projection of P onto this subspace. Then writing 
Po = ViraeR, Cabvaly (x) gives Fo = Docc p, Cat” that satisfies (2.7). 
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But how to do pick Bo? The first step is to understand where the monomial 
basis B comes from. 


The Buchberger-Moller Algorithm. Since the evaluation map 


evaly : R[w1,...,2%7] — R09 
is onto, we find a monomial basis of R[a1,...,x27]/I(4’) by listing all monomials 
v1, ¢,..., successively applying evaly, and discarding those whose images are 


linearly dependent on the images of the previous ones. Once we have found 5500 
monomials, we have a monomial basis B. 

A better approach is due to Buchberger and Moller in 1982. Monomials 
are ordered using a monomial order, and keeping track of linear relations coming 
from discarded monomials leads to a Grébner basis G for I(4) such that B is the 
order ideal consisting of all remainder monomials for division by G. A description 
of how this works for lex order can be found in Section 2.3]. 

The numerical instability of the Buchberger-Moller algorithm is well known. 
Here is a simple example, 


EXAMPLE 2.10. Let X = {p1, p2,p3} = {(—1, 0), (0,c), (1,0)} C R?, where c is 
a parameter, possibly 0. 


P1 Pa D3 


Using graded lex with x > y, the Buchberger-MGller algorithm gives the following 
reduced Grébner bases G for I(4’) and monomial bases B for R[x, y]/I(4): 
c=0:Go= {x — x,y}, Bo = ia} 
c#0:Ge= {a -1- ty, cy, y? —cy}, Be= {1,y,}. 
When c is large, G. and B, are what we want. But when c is not 0 but really close 
to 0, then we would prefer Gp and Bop. One way to achieve this is to replace “exactly 
linearly dependent” with “approximately linearly dependent” when considering the 
evaluation of a new monomial. 

To see what this means, specify a tolerance ¢ for approximate linear dependence, 
and suppose 0 < c < e. Listed in increasing order for graded lex with x > y, the 
monomials are 1,y,2,y?, ry, 2°,.... When we evaluate these at 4 and consider 
linear dependence, we get the following table: 
evaly (x*) Bexact Bapprox 


1 (1,1,1) | {1} {1} 

y | (0,c,0) | {ly} | {1} 

x | (-1,0,1) | {1l,y,a} | {1,2} 

y | {0,e7,0) | {1,92} | {1a} 

xy | (0,0,0) | {1,y,a} | {1,2} 

z* | (1,0,1) | {l,y,2} | {1,2,27} 
The line for y is revealing, since (0,c,0) and (1,1,1) are linearly independent but 
e-linearly dependent. This explains why y is added to Bexact but not Bapprox. The 
other entries in the table are similar. Since Bapprox = Bo, we are happy. << 


In the above example, we did not say exactly what we mean by e-linearly de- 
pendent. Making this precise takes careful thought and can be done in several 
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ways. The paper [199] introduces the Approximate Vanishing Ideal algorithm, and 
the 2014 PhD thesis of Limbeck studies this algorithm in detail and describes 
a variant called the Approximate Buchberger-Moller algorithm. For expository pur- 
poses, we will say a few words about the latter. 


The Approximate Buchberger-Moller Algorithm. We begin with a modified 
Buchberger-Moller algorithm that focuses on the monomial basis B. Given points 
X = {pi,...,ps} CR”, we have an isomorphism 


Riay,...,2%n]/1(X) 22 RB. 
Pick a graded monomial order < on R[21,..., 2] and list monomials in increasing 


order according to <. We add monomials one at a time to B, starting with B = {1}. 
After processing the monomials of degree d, consider the monomials 


(2.8) L = {monomials of degree d+ 1} M (a1BU--+-Ua,B). 


At this point, B consists of monomials of degree < d, so the important monomials 
of degree d+ 1 to consider are those on the border of B. This is the set DL defined 
above. We work through the elements of LZ according to <. For each #* € L, we 
compute evaly (2°). Then: 


e If evaly (x) U evaly(x°)(B) is linearly dependent, then discard x*. 
e Otherwise, add x to B. 


Once all monomials in L have been processed, we move to the next degree and 
continue until L = 0. By Algorithm 22 with ¢ = 0 and Theorem 4.3.1], the 
algorithm terminates. This is the exact algorithm, which has two nice properties: 


e The final value of B is an order ideal basis of R{#1,...,%,]/1(%). 
e By keeping track of the linear dependencies of discarded monomials, one can 
obtain a border basis for I(%). 


For the approximate version of this algorithm, we follow Algorithm 22]. 
Fix a threshold « > 0. As in the exact algorithm, suppose that we are deciding 
whether to add «® to B. Consider the (1+ |B]) x s matrix A whose first column 
is evaly (x) and remaining columns are evaly(x*) for 2° € B in decreasing order 
according to >. (We will give an example of this below.) Then A’ A is a symmetric 
semi-definite matrix whose entries the dot products of the columns of A. Its smallest 
eigenvalue \ has the property that 


(2.9) VX = minyyy=1 ||Avlla- 


There are two methods to find \: compute a SVD of A”A or solve the above 
minimization problem by numerical methods. Now we decide what happens to «® 
as follows: 


elf VA\K< e, then discard x@. 
e Otherwise, add x® to B. 


Once the monomials of degree d are processed in this way, we define L as in (2.8) 
and continue until L = @. We call this the e-algorithm. By Theorem 4.3.1]: 


e B is connected to 1, though it may fail to be an order ideal. 

e If B is an order ideal, then when VX < €, eigenvectors for \ serve the role of 
é-linear dependencies, and from these, one can obtain a set G of polynomials 
that generate an approximate vanishing ideal of X. 
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The full theory of these concepts is explained in and [257]; a nice exposition 
appears in [242]. For an algebraist, an approximate vanishing ideal is troubling 
since its generators usually generate the unit ideal in the standard sense. 

Ignoring these complications, there is one elementary feature of the c-algorithm 
we need to mention. By (2.9), 


A =0 <=> columns of A are linearly dependent 
<> evaly (x) Uevaly(x*)(B) is linearly dependent. 


When \ = 0, we have VX = 0 < €, so any monomial discarded by the exact version 
of the algorithm will also be discarded by the e-algorithm. However, the condition 
VX < € gives us leeway to discard more monomials. The result is that the set B 
produced by the e-algorithm may fail to be a basis of R[ay,...,¢n]/I(4). 

Here is an example due to Limbeck Example 4.3.8]. 


EXAMPLE 2.11. Let X = {(0,0), (0,1), (1,0), (1, 1), (.5, .5)} C R? consist of the 
vertices of the unit square and its center. We will use graded lex order with x > y 
and tolerance ¢« = 0.2. The ¢-algorithm starts with B = {1} and successively adds 
x and then y to obtain B = {y,x,1}. At this point, L is defined to be {x?, ry, y7}, 
and to decide about 2”, the matrix A is 


0 0 0 1 
0 1 0 1 
A=|]1 011 
1 g£ 2 4 
25.5 .5 1 


The smallest eigenvalue of A7A is \ = .0246396, and V/\ = .15697 < «. Hence we 
discard x”. When we turn to ry, we get a new first column, so A becomes 


B 0 0 1 
0 101 
A=|0 011 
1 111 
E 5 5 1 


The smallest eigenvalue of A7A is now \ = .145979, and VX = .38207 > «. Thus 
we add xy to obtain B = {xy, y, x, 1}. 

One can check that y? is also discarded, so we are done with degree 2. Then 
we define L = {x?y, ry”} (the degree 3 elements of xB U yB), and both of these 
are discarded as well. So B is still {ay,y,x,1}. It follows that when we increase 
the degree to 4, we get L = @ and terminate. 

There are a lot of interesting things going on here. The set B = {xy,y,x,1} 
has four elements, yet dim R[x, y]/I(¥) = 5 since we have five points. So B is not 
a basis of R[x, y]/I(4). Also, the four discarded monomials are x”, y?,x?y, xy’. 
Using eigenvectors of \ to represent ¢-linear dependence, one gets polynomials 


g. = —.697x? + .715a — .044 

go = .685y? + .041ary — .724y — 0212 + .054 

93 = —.698x7y + .715ay — .008y — .008x2 — .013 
ga = —-698xy? + .7152y — .008y — .008a — .013 


that generate the approximate vanishing ideal of V. 


(2.10) 
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The polynomials g1, 92, 93, 94 generate the unit ideal in R[x, y], and if one asks 
a program such as MATHEMATICA to solve gi = g2 = g3 = g4 = 0, one gets the 
empty set. To get a better sense of how the g; relate to the five points in 7, here 
is a picture that shows ¥ in red along with a contour plot of |g:|+|g2|+|g3|+ |gal: 


14 11 L l L 
0.0 0.2 0.4 0.6 0.8 1.0 


You can see that the g; are not very small at the central red point but get close to 
zero in the dark blue areas. So we lose the center of the square, but its influence is 
felt since the dark blue areas have been pulled towards the center. 

This example makes it clear that the approximate world requires us to think 
differently about basic objects in algebra and geometry. << 


One exciting feature of the ¢-algorithm is that it can produce a set B with fewer 
monomials than the number of points in V. In a sense, the algorithm is saying that 
because of the tolerance ¢, fewer monomials may be required. 

In Example 2.11] we started with five points but ended up with only four 
monomials. In some cases, the reduction is more dramatic, as we will see when we 
apply these ideas to the oil production model discussed earlier in the section. 


Back to the Oil Well. First observe that when the valve settings 71,22 in (2.5) 
are both zero, there is no oil production. Hence the oil production polynomial we 
seek should lie in the ideal (21, 72) C R[x, v2]. This restriction leads to the subideal 
Buchberger-Moller algorithm, which computes a monomial basis of 


(21,2) /(1(4X) M (@1,22)) —_ R[a,, oa ,7|/1(X). 


This algorithm and its approximate variants are developed in [240]. 
In [£3], an approximate subideal algorithm is applied to the 5500 data points 
X = {pi,..-,pss00} C R’. The result is a set Bo consisting of 


31 monomials 


in the variables 71,...,27 from (2.6). When we orthogonally project the oil pro- 
duction data P = {P,,..., P5500} € R°°°° onto the span of evaly(Bo), we get a 
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polynomial with 31 terms 


(2.11) Fo= Do ca 


aE Bo 


that models oil production. The precise formula for Fo is given in [13], along with 
a discussion of how well the model fits the data. 

An important observation is that in this model, the pressure variables 3, ..., 27 
in serve as proxies for basic physical quantities such as kinetic energy, work, 
and dissipation that cannot be measured directly. Unfortunately, many of the 31 
monomials in the model don’t make sense physically, and the same is true for the 
polynomials 91, 92,93, 94. Creating an algorithm that respects the physics without 
knowing it in advance is a continuing challenge. 

A different analysis of this example that focuses on the five pressure variables 
can be found in [242]. 


Final Comments. In this section, we explored two major examples that used a 
variety of numerical methods from linear algebra: 


e To get well-conditioned multiplication matrices for generic polynomials in 
C[z, y] of degree d, the paper used QR. factorization with column pivots. 
In (2.4), the nice order ideal B was replaced with the more scattered B’ that 
gave multiplication matrices with much better condition numbers. 

e To model oil production, the paper used an €-approximate version of 
linear dependence to create a small set of monomials that gave the desired 
model (2.11). But as we saw in Example 2.11] these methods replace the 
vanishing ideal with polynomials that usually generate the unit ideal. 


In both examples, we worked in a finite-dimensional quotient of a polynomial ring 
by an ideal. In the first example, we knew the ideal, but to get the solutions 
via multiplication matrices, we had to abandon working with an order ideal of 
monomials. In the second example, we had the solutions (the data points), but to 
get a reasonable set of monomials, we had to abandon the usual notion of ideal. 

The moral is that while numerical linear algebra has a lot to offer algebraic 
geometry, one needs to approach the whole enterprise with an open mind and a 
willingness to think outside the box. 


2.2. Homotopy Continuation and Applications 


The previous section harnessed the power of numerical linear algebra to help 
study systems of polynomial equations. In this section, we will tap into the equally 
rich collection of numerical techniques used to solve ordinary differential equations. 
These techniques are relevant to algebraic geometry because homotopy continuation 
enables us to solve a polynomial system by tracking solutions of a suitable system 
of ODEs. 

The plan is to explain how homotopy continuation works and then explore four 
substantial examples. 


The Idea of Homotopy Continuation. Suppose that we want to compute all 
complex solutions of a polynomial system f(z) =-+: = fn(z) = 0 for f; € C[z] = 
Clz1,...,2n]. Note that the number of equations matches the number of variables. 
For the time being, we will assume that there are finitely many solutions. We also 
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write the variables as z = (z1,...,2n) to emphasize that we are working over the 
complex numbers. 
In homotopy continuation, we have the target system F(z) = (fi(z),.--, fnr(z)) 


together with a start system G(z) = (gi(z),---,9n(z)) whose solutions we know. 
We get from one system to the other via the homotopy 
(2.12) A(z,t) =tG(z)+(1-t)F(z), te [0,1]. 


Thus H(z, 1) is the start system G(z) and H(z,0) is the target system F(z). The 
target system is t = 0 since there are more floating point numbers near 0. 

The basic idea is that as we decrease t from 1 to 0, H(z,t) deforms the start 
system G(z) = 0 into the target system F(z) = 0. In the process, we hope that 
solutions of G(z) = 0 will deform into solutions of F(z) = 0. 

To formulate this precisely, let z) be a solution of G(z) = 0 and assume that 
z(t) : [0,1] > C” is a differentiable function satisfying z(1) = zo. By the chain rule, 


d “OH ; OH 
qite),t) = bP ce (z(4), t) z(t) + ay (2): é) 


i=1 


Hence the following are equivalent: 
e H(z(t),t) =0 for all t and z(1) = 2. 
e z(t) satisfies the initial value problem (IVP) 


(2.13) YH we. NAw + Fe.) =0, 20) = 20. 


A solution z(t) of (2.13) is called path. If the path is defined on all of [0,1], then 
z(t) deforms the solution z(1) = zo of the start system G(z) = 0 into a solution 
z(0) of the target system F(z) = 0, exactly as we had hoped. 

However, not all solutions of the start system deform so nicely. In fact, there are 
many possible behaviors for a given path. Let’s begin with three good behaviors. 


EXAMPLE 2.12. We will use the start system 2? — z = z(z? — 1) = 0 with 
solutions z = 0,+1. Here are the paths we get for the three different target systems: 


(a) 23 — 3.122 4+2z+.05=0 (b) 23 — 2? =0 (c) 22 -2.5z2+.9=0 
| 
|) re 1 ae 
4 -2 
Of- 4 

-6 
a, * -1 


-2b d 2 -10 . 
00 O02 04 O06 08 1.0 00 O02 04 06 O08 1.0 00 O02 04 06 O08 1.0 


e In (a), the paths go from 0, +1 to the roots of z° — 3.12? +2z+.05 = 0. This 
is as good as it gets. 

e In (b), two paths meet when t = 0, which reflects the fact that 0 is a double 
root of z° — z? =0. 

e In (c), one of the paths goes to oo, which makes sense since the target system 
is the quadratic 2? — 2.5z+.9 = 0. 
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It may seem odd to claim that (b) and (c) represent “good” behavior. But in 
practice, it happens often that start system has more solutions than the target 
system, so that some solutions must either coincide or go to co as t > OF. <- 


EXAMPLE 2.13. With the same start system, we now give examples of some 
bad behaviors that can occur: 


(a) 23 — 3.122 +2z=0 (b) z3 — 3.122 + 2z—.05=0 | (c) -223-—5z2+4z+1=0 
3 3 : 10 : 


2 2 
ee ne 5 
4 1 
0 0 
-5 
1 = : 


-2 -2 . -10 1 
00 #02 04 O06 O08 1.0 00 02 04 O06 O08 1.0 00 O02 04 06 O08 1.0 


e In (a), two paths cross when ¢ = 2/3, which means that the IVP fails to have 
a unique solution. This makes it harder to follow paths numerically. 

e In (b), as we go from 1 to 0, two paths meet as t > 0.739707, and neither 
can be continued over R. 

e In (c), the deformed system drops degree when t = 2/3, which means that 
one path goes to co as t > 2/37. 


These problems arise because the deformed system fails to be generic for all t > 0. 
In (a) and (b), H(z,t) acquires a double root, while in (c), it drops degree. <-> 


We can avoid the problems encountered in Example by working over C 
and using the gamma trick. To explain this, note that the homotopy isa 
line segment connecting G(z) to F(z). Bad things happen when the line segment 
intersects loci where the degree drops or multiple roots exist. These bad loci are 
defined by algebraic conditions, which over C have real codimension at least two. 
Since a line segment has dimension 1, we have a good chance of missing the bad 
loci provided we wiggle the line segment a bit. 

The gamma trick achieves this by replacing the line segment [0,1] with the 
circular arc in the plane defined by 


yt 
t) = , te (0,1). 
= aH 0) 
where we assume y € C \ R (to ensure that the denominator never vanishes) and 
|y| = 1 (for numerical reasons). Since 7(0) = 0 and 7(1) = 1, we get the new 
homotopy 
(2.14) H(z,7(t)) = r(t)G(z) + (1 — T(t) Fe) 


that connects F(z) (when t = 0) to G(z) (when t = 1). Lemma 7.13 of 
ensures that with finitely many exceptions, this homotopy will avoid the bad loci. 

Another advantage of working over C is that for the target system —2z3 —5z?+ 
4z +1 =0 from (c) in Example 2.13] we can replace the start system 23 — z = 0 
with z3+1 = 0, whose roots are the negative cube roots of unity. This is good from 
the numerical point of view since they are equally spaced around the unit circle. 
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Then picking y = .4+ .77i gives the following splendid picture from p. 26]: 


t=1 


Making Homotopy Continuation Work. We know that with high probability, 
a randomly chosen y with |y| = 1 will avoid the bad loci. But there are many other 
issues that need to be addressed in order for homotopy continuation to be a useful 
tool for solving equations: 


e Given the target system F(z) = 0, explain how to pick a suitable start system 
G(z) =0. 

Prove that all isolated solutions of F(z) = 0 are found in this way. 

Decide if a path goes to oo. 

Understand the endgame as t + 0°. 

Understand condition numbers and adaptive precision. 

Decide if two paths give the same solution (multiplicity). 

Handle positive dimensional components of the solution variety. 

Handle systems where the number of variables doesn’t equal the number of 
equations. 


One could write a book about this, and in fact there are two books and 
that go into great detail about the above issues. There is also the book Condition 
devoted to the study of condition numbers. 

In addition to a firm theoretical foundation, we are fortunate to have excellent 
software that implements homotopy continuation: 


BERTINI [20], named for the Bertini Theorems. 

PHCpack [365], for Polyhedral Homotopy Continuation package. 
NAGM2 [258], for Numerical Algebraic Geometry for MACAULAY2. 
HOM4PS [78], for Homotopy for Polynomial Systems. 
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Higher Dimensional Components. The solution set of a system of equations 
breaks up into irreducible components of various dimensions. So far, we have 
assumed that each component is a point (the zero dimensional case), which can be 
represented numerically by an approximation of the point and found (for example) 
by homotopy continuation. 

Now suppose that an irreducible component V of the solution set has positive 
dimension. How do we find V numerically? How do we represent V numerically? 
One way would be compute some sort of “generic point” of V. In Chapter [I] we 
discussed several notions of generic point. Suppose that V C C” is defined by a 
prime ideal p C R = C[x,...,2»]. Recall from Section [1.2] that we encountered 
three types of generic point: 


Type Point Description 


Enriques & Chisini | p<@V\W | W = subvariety of points to avoid 
van der Waerden (€1,---,€n) |& =a +p e R/p=C[V] CC(V) 
Grothendieck p € Spec(R) | V = V(p), p prime ideal 


The last two entries in the table give full information about V but can’t be done 
numerically. The first makes sense numerically (compute a random point of V), 
but a single numerical point gives little information about V. One exception is 
V =C", where a random point is useful—for n variables, make a random choice 
for each variable. 

To apply this to an arbitrary irreducible variety V C C” of dimension & and 
degree d, we will use the algebra and geometry of Noether normalization. 


Noether Normalization. For V as above, Noether normalization (see, for exam- 
ple, [104] Chapter 5, §6]) tells us that if we pick generic linear forms L1,..., L, in 
L1,.-+,Upy, then C[Ly,...,L,] C R= Clay,..., 2] induces an inclusion 

(2.15) Cify,...,L4] > R/p = C[V] 

such that C[V] is integral over C[Li,..., £,]. Here, R = C[x1,...,2,] andp =1(V), 
and we write (2.15) as C[LZ1,...,£] C C[V]. Geometrically, this inclusion is the 
map on coordinate rings induced by the linear map 

(2.16) m:V—+C*, a(p) = (Li(p),---,Le(P)). 


Furthermore, being an integral extension of rings tells us that (2.16) is a 
finite morphism of varieties. 


Monodromy. The inclusion (2.15) and the morphism (2.16) lead to interesting 
groups as follows. Recall that V has dimension k and degree d. Then: 


Algebra: We get a field extension Geometry: We get a finite fiber t~!(q) 


C(Li,..., Lx) C C(V) of degree d. = {p1,...,pa} for qe C* random. A 
The minimal polynomial of a generic loop ¥ at q lifts to a path in 
primitive element of this extension V with 4(0) = p;, V1) = poi), 7 € Sa. 
has the Galois group G C Sq. This gives the monodromy group G C Sq. 


The wonderful fact is that these groups are equal up to conjugation, as proved 
by Harris in 1979 [187]. Since the Galois group of an irreducible polynomial acts 
transitively on its roots, we see that the monodromy group acts transitively on the 
points of the fiber 7~1(q). 
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Path lifting can be done numerically, which gives a numerical method to com- 
pute the monodromy group. 


Witness Point Sets. For our irreducible component V C C” of dimension k and 
degree d, random linear forms L;,...,L, give a finite morphism 7: V > C* of 
degree d. For a random q € C*, we have 


x '(q) =VOL, L=random affine subspace of codimension k in C”. 


The set VOL has d points and comes equipped with a G C Sg. The points of VOL 
and the group G can all be computed numerically. 

We call VN L a witness point set of V. This is the numerical replacement for a 
generic point of V. Unlike the algebraic setting of van der Waerden or Grothendieck, 
a witness point set doesn’t directly allow us to recover V but still contains a lot 
of useful information, including dimension, degree, and monodromy. Furthermore, 
by computing multiple witness point sets (this is the idea of sampling), one can 
sometimes build up a vivid picture of V. You will learn more about sampling in 
Section 

However, our discussion so far is unrealistic, since in practice we don’t know 
the irreducible components of the solution set in advance. So now suppose that V is 
the solution set of a system of polynomials. The goal is now to find a witness point 
set of each irreducible component of V. This is what is called numerical irreducible 
decomposition. 


Computing a Numerical Irreducible Decomposition. Given a system of 
equations, the dimension of the solution set V C C” is the largest integer k such 
that VM L 4 @ for a random affine subspace L C C” of codimension k. This can 
be determined numerically, so we can assume that we know the dimension k of V. 

Noether normalization still applies in this situation, so that if DZ is defined by 
[, = -:- = Ly = 0, then we get a finite morphism 7 : V > C* as in (2.16), 
giving a candidate witness point set m~'(q) = VN L as above. Since L is generic 
of codimension k, it will miss all components of V of dimension strictly smaller 
than k. It follows that VO L is a union of witness point sets of the dimension k 
components of V. 

We still have a monodromy group G in this situation, though it may fail to 
act transitively on VM LZ. The key observation is that the orbits of Gon VN L 
correspond to the irreducible components of V of dimension k. Furthermore, each 
orbit is a witness point set of the corresponding component, and the number of 
elements in the orbit is the degree of the component. Hence we have witness point 
sets for the top-dimensional irreducible components of V. In practice, several tools 
are used to achieve this decomposition, not just monodromy. See Chapter 10] 
for the details. 

An inductive procedure described in Chapter 10] and Chapter 13] 
explains how to go from here to the full numerical irreducible decomposition of V. 
These references also explain how one get information about the multiplicities of 
the components. 


The Bertini Theorems. The methods described above work because there is 
a well-developed theory of what happens when we intersect a variety V with a 
generic affine subspace L. The basic results are due to Eugenio Bertini (1846- 
1933), a student of Cremona. His theorems, originally stated for P” in 1882, used 
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the language of linear systems. Kleiman’s article gives a nice overview of 
what Bertini did and what his results mean in modern language. 

To give a quick overview of the Bertini theorems, suppose that V C C” has 
dimension > 1 and let H C C” be an affine hyperplane. If V is irreducible and H 
is generic, then the Bertini theorems tell us that 

e V4 is smooth outside Veing N A. 

e If dim(V) > 2, then VN ZF is irreducible. 

Also, for arbitrary V and generic H, we know that 

e dim(V N H) = dim(V) — 1. 

e deg(V M H) = deg(V). 

It follow that if a variety V is irreducible of dimension k > 1, then for generic affine 
hypersurfaces H,..., Hz, 


VNH,N---N Hy consists of deg(V) smooth points. 


Since L = H,---M Hx is generic of codimension k, we see that the structure of 
the witness point set V M L is a consequence of the Bertini theorems. Intersection 
with linear spaces is a key part of BERTINI. We now see where the program got its 
name! 


Four Examples. Rather than describe further features of software for numerical 
algebraic geometry, we instead give four extended examples to illustrate how these 
software packages are used in practice. 


EXAMPLE 2.14 (Lagrange Multipliers). Recall that optimizing x? + cryz — z? 


on the sphere x? + y? + z? = 1 in R® leads to the system of equations: 
327 + cyz =- 2x 
cuz = X- 2y 
cay —2z=-2z 
ety? +27 =1. 


(2.17) 


For c = 2, one solution is (a, y, z, A) = (-2, ee 2, —4). However, a program such as 


BERTINI would write this solution using floating point complex numbers: 


-6 .666666666666664e-01 -6.661338147750939e-16 

3. 333333333333339e-01 -6.383782391594650e-16 

6 . 666666666666674e-01 -1.276756478318930e-15 

-1.333333333333333e+00 -2.220446049250313e-16 

—— 

real part imaginary part 
The real parts are reasonable, while the imaginary parts look odd until we realize 
that exponents such as “e-16” are telling us that the imaginary parts are zero! 
There are several ways to solve using BERTINI. When c = 2, the simplest 

way is to use the start system 2? = 1,y? = 1,2? = 1,\? = 1 with 16 solutions. 
BERTINI tracks 24 = 16 paths, which behave as follows: 


8 paths go to 8 nonsingular solutions 
(2.18) 4 paths go to 2 solutions, each of multiplicity 2 
4 paths go to oo. 


This gives 10 solutions, as reported in Example 2.2} 
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For such a simple system, what we did in (2.18) is all that is needed. But by 
looking more deeply, we can get insight into how BERTINI can solve more compli- 
cated systems. Recall from Example[2.2]that the system (2.17) has 12 rather than 
10 solutions when c = 2.1 and c = 2.01. So c = 2 has special features that differ 
from the generic behavior of the system. Furthermore, we expect the generic case 
to be easier to solve since there may be fewer multiple solutions or fewer paths that 
go to oo, both of which can be hard to deal with numerically. 

In BERTINI, this is implemented in a two step strategy: 

e Step1: Solve the system (2.17) with a random choice of the parameter c € C. 

We use the same start system as in (2.18]), so there are still 16 paths to follow. 
But the paths behave differently: 

10 paths go to 10 nonsingular solutions 

4 paths go to 2 solutions, each of multiplicity 2 

2 paths go to oo. 

e Step 2: Solve (2.17) with c = 2, but now the start system is (2.17) with 
the c used in Step 1, and the solutions of the start system are the 14 finite 
solutions found in Step 1. When we follows these 14 paths, we obtain: 

8 paths go to 8 nonsingular solutions 
4 paths go to 2 solutions, each of multiplicity 2 
2 paths go to oo. 
We get the same 10 solutions as in Example 2.18] with fewer paths to follow. 
Step 2 is what BERTINI calls a parameter homotopy (see Chapter 6] for details). 
Going from 16 paths to 14 paths is not very impressive, but the reduction can be 
much more dramatic, as happens in next example. <— 


EXAMPLE 2.15 (The Four-Bar Mechanism). In R?, consider a triangle CDE 
with edge lengths shown below and line segments AD, BC with indicated lengths. 
When we fix the points A, B and allow the triangle to move, the point F traces out 
a nice curve: 


E 


A 6(0, 0) (4,0)0B 


This is a classic four-bar mechanism, with bars AD, CD, BC and AB (not drawn) 
and a rigid body (the triangle CDE) attached to one of the bars. 
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Our first task is to find the equation of the curve traced out by FE. If we set 
C = (u,v),D = (c,s),E = (a,y), then the lengths in the above picture give five 
equations: 
c+ s% = 2? 
(u — 4)? +? = 3? 
(2.19) (wa—cP+u—sP ae 
(a — 0)? + (y— 8)? = (5/2)? 
(x —u)? + (y—v)? = (5/2)?. 
These equations are not sufficient since they ignore the orientation of the triangle, 
which we want to preserve. The vectors 


C—-D=(u-cv-—s), E-D=(4%#-cy-s) 


determine a parallelogram with signed area +6 when we go counterclockwise from 
C—Dto E—D. This gives the equation 


(2.20) det & eo ) =6 


L-C€ ys 

Combining this with and eliminating u, v,c, s gives the sextic equation 
2025—26822—1475x7—271227+4+286424—7682° +642°— 
5724y+8832ary—432027y+1536x%y—192¢4y+5949y?— 

271 2xy?+3680r7y?—15362°y?+19224y? —4320y3+ 

15362y? 38427 y3+816y4—768ry1+19207y4—192y°+64y° = 0. 


(2.21) 


The resulting picture has a surprise: 


The real points of (2.27) consist of two connected components! 


The Other Connected Component. To explain the other component, first 
observe that the equations (2.19) have two symmetries: 

e Flip about the bottom edge of the triangle. 

e Flip about the z-axis. 
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These symmetries are orientation-reversing. If we apply both to the gray triangle, 
we get the dashed triangle shown below. It has same orientation as the gray triangle 


and hence satisfies the signed area equation (2.20). This means that the point E’ 
on the dashed triangle traces out the other component! 
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The Nine-Point Problem. A four-bar mechanism has nine degrees of freedom: 


4: the two points A and B in R? 
3: the three edge lengths of the triangle CDE 
2: the two lengths of the line segments AD and BC. 


It follows that if we specify nine points on the curve, there are now zero degrees of 
freedom. Intuitively, this means that there should be only finitely many four-bar 
mechanisms whose curve goes through the nine points. In 1923, Alt posed the 
nine-point problem of finding a four-bar mechanism whose curve goes through nine 
arbitrarily chosen points in the plane. 

In 1992, Wampler, Morgan and Sommese used homotopy continuation 
to show that up to known symmetries, there are 1442 four-bar mechanisms whose 
curve goes through nine given points in R?. Computing this number was one of the 
first big successes of numerical algebraic geometry. 

The nine-point problem is discussed in the books and [324]. Following 
their approach, we change coordinates so that one of the nine points is the origin 
and replace coordinates x, y in the plane with isotropic coordinates z = x+iy,z* = 
x —iy. Thus z* = Z precisely when x,y are real. Then the eight remaining 
points lead to a system of eight equations f; = --- = fg = 0 in eight unknowns 
a1, 0}, b1, b}, a2, a4, bo, bS. Here, a1, a}, a2, a5 are the isotropic coordinates of A, B, 
while 61, bj, be, b} have more complicated descriptions 5.5.2]. 
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Four Bézout Numbers. When solving f; =--- = fg = 0 by homotopy continu- 
ation, there are four different ways to set up the start system: 


e Each f; is a polynomial of degree 7. When we work in P®, the standard 
start system of degree 7 polynomials would have 7° = 5,764,801 solutions by 
Bézout’s Theorem. This is a lot of paths to track! 

e One virtue of isotropic coordinates is that the variables a1, aj, b1, bf, a2, a5, 
bo, bs in f; can be grouped in several ways. If we use the variable groups 


(2.22) {a1, a7, b1, by} and {a2, 05, b2,b5}, 


then f; has degree 4 in each variable group. This leads to a start system on 
P* x P* with 4,587,520 solutions by Bézout’s Theorem for (P*)?. 

e For the variable groups {a,, b; }, {at, bi}, {a2, bo}, {a4, b5}, f; has degree 2 in 
each group. When we work in P? x P? x P? x P?, this gives a start system 
with 645,120 solutions by Bézout’s Theorem for (P?)*. 

e The polynomials f;,..., fg all have the same Newton polytope P C R®. By 
the LATTICE_VOLUME command in POLYMAKE [9], the normalized volume of 
P is 79,135. This leads to a start system with vol(P) = 79,135 solutions by 
Bézout’s Theorem for the toric variety Xp. 


BERTINI can work in P§, (P+)? and (P?)* but not in Xp, though PHCPack can. 

We ran this example in BERTINI using the origin and eight randomly chosen 
points, with variable groups (2.22). This might seem inefficient (4,587,520 paths to 
track), but by using the BERTINI method called regeneration, the number of paths 
to follow can be dramatically reduced. 

The rough idea of regeneration begins with a start system (g1,...,98), where 
each g; is a carefully chosen product of linear polynomials. But rather than go 
directly to the target system (f1,..., fg), we proceed in eight stages called levels. 
In Level 0, we homotop (g1, £2...,€s) to (fi, @2...,8), where each @; is linear. In 
Level 1, we use the finite solutions of Level 0 and a homotopy for each linear factor 
of gg to find solutions of the start system (fi, 92, ¢3,..., &g). We then homotop this 
to (fi, fo, €3,...,€3). We continue until Level 7, where we finish with the solutions 
of (fi,..-, fg). See 5.4] for the details of how this works. 

For the nine-point problem, regeneration gives the following BERTINI output: 


level! paths | nonsing endpts | total discarded | sing endpts | inf endpts | higher dim’1 | other bad endpts 


total] 152223 


Only 152,223 paths were followed. The key feature of the output is the underlined 
number 8652. This is the number of the nonsingular solutions (possibly complex) 
of the nine-point equations. Because of a known six-fold symmetry for solutions of 
the nine-point problem, it follows that there are at most 


8652/6 = 1442 distinct four-bar mechanisms through 9 points in R?. 


In 1992, the authors of used a precursor of BERTINI written in Fortran on 
an IBM 3081. The computation took 332 hours of CPU time. In 2018, the same 
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computation takes about 60 hours using BERTINI on a laptop and under 2 hours 
using the parallel version of BERTINI on a cluster. << 


EXAMPLE 2.16 (Parameter Estimation in a HIV Model). The 2016 paper Nu- 
merical algebraic geometry for model selection and its application to the life sciences 
by Gross, Davis, Ho, Bates and Harrington discusses model validation, model 
selection, and parameter estimation using tools from numerical algebraic geometry. 

We will focus on parameter estimation in a model that describes how the HIV 
virus infects CD4* T cells and macrophages, which are types of white blood cells 
in our immune system. Table [I] shows the model used in [179]. In what follows, 
we write “T cell” instead of “CD4* T cell” for simplicity. 


Description Interaction Parameter value 
Generation of new T cells 9—>T 10 
Generation of new macrophages 0+ M 1.5 x 107+ 
Proliferation of T cells by presence of pathogen T+ V Ba »>(T+V)+T 2x 10-3 
Infection of T cells by HIV T+V rs qT; 3x 1073 
Proliferation of M by presence of pathogen M+V ay (M+V)+M_ 7.45 x 1074 
Infection of M by HIV M+V a M; 5.22 x 1074 
Proliferation of HIV within T cell qT; es, V+T; 5.37 x 1071 
Proliferation of HIV within macrophage M; ke , V+M; 2.85 x 1071 
Natural death of T cells T 39 0.01 
Natural death of infected T cells ay 2, 0 0.44 
Natural death of macrophages M ass 0 6.6 x 10-3 
Natural death of infected macrophages Mi se 0 6.6 x 107-3 
Natural death of HIV v 2559 ? 


TABLE 1. HIV Model 


The interactions in Table [I] are easy to explain. For example, consider 


PEG rey ar 


The first line is the creation of more T cells when the virus is detected, the second 
line is infection of T cells, and the third line is when infected T cells release more 
copies of the virus. The story for macrophages is mostly similar. One difference 
is that the death rate d2 for infected T cells is much larger than the death rate 6; 
for uninfected ones, while for macrophages, the death rates 63 and 64 are the same 
since macrophages can be long-lived virus reservoirs. The final line of the table 
does not specify a value of the parameter 65. The goal is to estimate 65 using data. 


The Mathematical Model. Table [I] is an example of a biochemical reaction 
network. Such networks will be studied in detail in Chapter [5] To model the 
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network in Table[i] we introduce variables 

T = uninfected T cells 

T; = infected T cells 

M = uninfected macrophages 

M; = infected macrophages 

V =HIV virus population. 
Using the Law of Mass Action (to be explained in Chapter 5), we get a system of 
ordinary differential equations: 
£T = 5, +kTV — koTV — 5,T 
2T; = kgTV — b:T; 
£M = s2+k3MV —kaMV — 63M 
4 Mj = kaMV — 64M; 
£V =ksT; + keM; — OV. 


(2.23) 


Steady States. Stable populations occur when 

d d d d d 

ag = gH a = t= a Ho. 
Over C, the polynomials on the right-hand side of (2.23) define the steady state 
variety 


V°COxC=C, 
where the variables T,T;, ,M;,V live in C® and the unknown parameter 6; lives 
in the final C. In [179], MAcAULAY2 is used to decompose V** into irreducible 
components 
V* =Vin UVe, 

with main component Vin = V(fi, fa, f3, fa, fs), where 

fi = 5742 M — 2453 M; — 130500 

fo = 2599087; — 46607 M7; + 48400006; — 20200500 
(2.24) fs = 17721T + 46607 M; — 484000065 + 2479500 

fa = 484000 V 65 — 184547 M; + 48400006; — 20200500 

fs = 2453 Mi V — 72600 M; + 130500V, 
and extinction component 

Ve = V(V, M;, 11M — 250, T;,T — 1000). 
For the symbolic computation, the floating point numbers in Table[I]were converted 
into rational numbers. 
We like the extinction component because the virus is gone (V = 0) and the 

T cells and macrophages are all healthy (TJ; = M; = 0). But medically, the main 
component is more interesting, since a positive solution of fj; = --- = fs = 0 


indicates a stable HIV infection that coexists with stable T cell and macrophage 
populations (infected and uninfected). 


Estimating 6; From Data. Suppose we have data 


(T, T;, M, M;, V) = (319.5, 46.85, 81.09, 136.67, 21.21). 
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We will use this vector, denoted data, to estimate the parameter 65. A naive 
approach would to substitute (T,7;,M,Mi,V) = data into (2.24) and solve for 
65. This is indeed naive, since the substitution results in an inconsistent set of 
equations. Remember that we rarely have exact solutions in the real world! 

It follows that the main component V,, C C® is disjoint from the data line 
L CC® given by 


L = {(data, u) | u€ C} = {data} x CC C®. 


We find the best estimate for 65; by minimizing the weighted distance 

pm (eae (T,— 46.85)? + (M—81.09)" (M4, 136.67)* cea + (ds—u)? 
Oy 2 o3 o%4 o% Ue * 
between a point (T,7;,M,M;,V,65) € Vm and a point (data,u) € L. In this 
formula, the denominators ¢? come from Gaussian distributions added to create a 
statistical model. Following [179], we assume o? = 2 for all i. 


Finding the Global Minimum. Many standard numerical methods will find a 
local minimum of d?. Using numerical algebraic geometry, we will find the point 
(T,T;,M, Mi, V,65,u) € Vin X C at which d? attains its global minimum. The value 
of 65 at this point gives the desired estimate. 

Since V,, is defined by fi =--- = fs =0, this is constrained minimization. By 
Lagrange multipliers, the minimum of d? on C® x C with f, =--- = fs = 0 occurs at 
a critical point of d? , Ai fi. Solving this in BERTINI gives 16 solutions, three of 
which are real. Evaluating d? at the real solutions gives the estimate 6; = 2.99876, 
which as noted in is close to the known value é5 = 3. <P 


EXAMPLE 2.17 (Maximum Likelihood Estimation in Phylogenetics). Our final 
example is from the 2017 paper Geometry of symmetric group-based models by 
Korta and Kubjas [237]. In the introduction, the authors write 

The maximum likelihood estimation aims to find parameters that maximize the 

likelihood of observing the data for the given phylogenetic tree and phylogenetic 

model. In practice, numerical methods are used to find the MLE. However, as 

MLE is a non-convex optimization problem, there is no guarantee of outputting 

the global optimum. Since phylogenetic models are not necessarily compact, the 

MLE might even not exist. 


One of the examples in is a phylogenetic model due to Cavender, Farris and 
Neyman based on the tripod Ky,3. The paper uses numerical algebraic geometry to 
show that in this model, there are data vectors for which the MLE does not exist. 


The Symmetric Group-Based Model. We have a root and three leaves, with 
probabilities 7,7 at the root and transition matrices P,, P2, P3 along the edges: 
(70,71) 


(2.25) Pr PS Ps P= 


Group: 
—2t; — Z/2Z = {0,1} 


In this model, the root represents an extinct species with an unknown distribution 
(70,71) of some binary character. For example, DNA is built from four bases 
A, T, G and C. We get a binary character by looking at the distribution 7 of 
purines (involved in A and G) and 7 of pyrimidines (involved in C and T) in 
the original species. This species has evolved into three species represented by 
the leaves, and the transition matrices P;, Pz, P; (also unknown) represent how the 
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binary character is transmitted to each new species. The parameters t1, tz, t3 record 
how much time has passed. 

We only have access to the three living species. Checking the binary character 
for each of these leads to eight possibilities. For i,7,k € {0,1}, let pij, denote the 
probability of observing character 7 in the first species, 7 in the second, and ’ in 
the third. The model tells us that 


Pijk = To (Pt )oi(P2)o0; (P3)ox ap 1 (Py) 1i(P2)1;(Ps)ik, 


where rows and columns of P,, P2, P3 are indexed by 0,1. The pix satisfy the 
obvious restrictions Pijk = 0 and Ppooo + +++ + P1114 = 1. 

In [237], the authors show that the p,,;;, that occur are characterized by equa- 
tions and inequalities as follows: 


1 linear equation poop +--:+pi11 = 1 
3 quadratic equations 


(2.26) 8 linear weak inequalities pj, > 0 
7 strict linear inequalities 
4 quadratic weak inequalities. 


The full set of equations and inequalities are written out in Example 2]. 


The Maximum Likelihood Estimate. Given a data point (woo0,---, 111), the 
ML estimate is the point (pooo,---,P111) of the model that “best explains” the data. 
Here, “best” means the point that gives the global maximum of the function 


O27) {) 2 


i,j,.kE {0,1} 


on the model. Standard numerical methods in statistics give a local maximum that 
is not guaranteed to be the global maximum. This is where numerical algebraic 
geometry enters the picture. 


The Maximum Likelihood Degree. For the model (2.25), we get a complex 
algebraic variety by taking its Zariski closure. This means using the equations in 
(2.26) and ignoring the inequalities. The number of complex critical points of the 
likelihood function on the Zariski closure is the ML degree of the model, 
which equals 92 in this case. 

However, the weak inequalities in (2.26) mean that the model has a boundary. 
Since a global maximum can occur at a boundary point, we need to take these 
into account. The boundary may have several components, depending on which 
inequalities become equalities. So there are multiple Zariski closures to consider, 
each with its own ML degree. 


An Example With No MLE. Consider the data 
(uoo0,-++;Ui11) = (100, 11, 85, 55, 56, 7, 75, 8). 


For these values of u;;,, the MLE estimate is maximal value of (2.27) on the model 
(2.26). The analysis in Example 8] involves 44 varieties, one coming from 
the equations in (2.26) and the other 43 coming from boundary components where 
various inequalities become equalities. 
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For each of these 44, the authors set up a Lagrange multipliers problem that 
they solved using PHCpAck. Of the 167 critical points that were found, 49 were 
real and positive. Plugging into (2.27) gives candidates for the ML estimate. 

For a boundary component where some inequalities become equalities, the other 
inequalities are ignored in computing the Zariski closure. Hence MLE candidates 
on such components need to be checked against the remaining inequalities. For the 
candidates coming from the seven largest values of (2.27), here is what happens: 


e In four cases, some of the inequalities in (2.26) become negative. It follows 
that these candidates lie outside the topological closure of the model and 
hence have no meaning. 

e In two more cases, weak inequalities derived from become negative. 
So these candidates also lie outside the topological closure of the model. 

This leaves the seventh top candidate, which also fails to satisfy (2.26). But 

unlike the previous six cases, it is contained in the topological closure of the 
model. The point corresponds to root distribution (79,71) = (1,0) and time values 
(th, ta, t3) = (.665, +00, .262). Thus 


point for (.665, +00, .262) = lim point for (.665, tz, .262). 
a to 
not in the model in the model 


It follows that the MLE does not exist as a point of the model. 

As the authors of emphasize, local methods would never be able to reach 
this conclusion—one needs a global approach based on numerical algebraic geome- 
try to discover the subtle behavior of the model (2.25). < 


Final Comments. The goal of this section was to introduce homotopy continua- 
tion, with many omissions, and give four examples of numerical algebraic geometry 
in action. But there is much more that can be done with these methods, as you 
will see in the next section. 


2.3. Applications of Sampling in Numerical Algebraic Geometry 
by Jonathan Hauenstein 


Many algorithms in numerical algebraic geometry start with a witness point 
set VOL for an irreducible variety V C C” of dimension k& and degree d, and utilize 
homotopy continuation to deform the affine subspace L. This idea has already 
been implicitly utilized in various computations in Section 2.2] such as computing 
monodromy groups, deforming parameters in Example[2.14] and using regeneration 
for solving in Example This section will explore other numerical algebraic 
geometric algorithms to obtain additional information about V. 


Membership Testing and Local Multiplicity. When V = V(G) for some 
polynomial system G’, a given point 2* is contained in V if and only if G(a*) = 0. 
Witness point sets provide an alternative approach for deciding membership in V 
without computing such a polynomial system G which also yields the local multi- 
plicity [325]. Recall that L is a general affine subspace of codimension k and let Lz 
be a general affine subspace of codimension k passing through «*. Using homotopy 
continuation, one simply homotops VN L to VM L,«. Then, «* € V if and only if 
x* €VOL,~. The local multiplicity of «* with respect to V is equal to the number 
of homotopy paths which end at x* and is positive if and only if «* € V. 


76 2. NUMERICAL ALGEBRAIC GEOMETRY 


EXAMPLE 2.18. Let V C C? be the quartic curve defined by 
ri +25 — (41 + £2)(a1 — £2)” =0 


and L C C? be the affine linear subspace defined by 42, + 9x2 = 3. The witness 
point set V  L consists of 4 points. In order to compute the local multiplicity at 
x* = (0,0), consider the linear slice LD,» defined by 42; + 9%2 = 0. The homotopy 
deforming VM LZ to VM Lz» yields one path ending at a point which is not 2*, 
namely (7605/6817, —3380/6817) = (1.1156, —0.4958), with the other three ending 
at x* showing that the local multiplicity of z* is 3. The following picture shows 
the 4 paths tracked along the quartic curve as L is deformed to LZ,» along V: 


r 


<- 


Witness Point Sets of Projections. The geometric counterpart to elimination 
theory described in Chapter [IJis to compute images of varieties under projections. 
For the projection map 7(x) = (#1,...,x%¢), Chevalley’s Theorem provides that 
m(V) C C* is a constructible set so that the closure under the Euclidean and 
Zariski topologies are equal, i.e., 7(V) is a variety. In particular, if m = dimz(V) 
and M is a random affine subspace of C’ of codimension m, we aim to compute a 
witness point set 7(V)N M =7(V)N M from VOL using the approach of [195]. 
Since k = dimV > dimz(V) = m, we can select L, to be a random affine 
subpace of codimension k inside of the codimension m affine space m~'(M) CC”, 
Using homotopy continuation, one simply homotops VM L to VM L,. Then, 


(VA Le) =m(V)AM 


is a witness point set for 7(V) thereby yielding the degree of 7(V). Moreover, the 
generic fibers of V + a(V) have dimension k —m and degree |VNL,|/|m(V NO L,)|. 


EXAMPLE 2.19. Consider the curve C = 7(V) C C? where V C C? is the cubic 
curve defined by 


2(1 +42) — (1-22) 


y(1 + *) _ ot = 0 and n(x, y,t) = (2,4); 


Since deg V = 3, suppose that we start with a witness point set VM L consisting of 
3 points where L C C® is plane defined by 42 — 2y +t = 3. We aim to compute the 
witness point set CM where M C C? is the line defined by 4% — 2y = 3. Since 
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dim C = dim V = 1, we simply have L,; = 7~!(M) = M x C. The homotopy from 
VOL to VOL, tracks 3 paths of which only 2 converge in C3 to the points 


(ae 342/11 22371) 


(2.28) (x,y, t) = 


10” 10 7 


Since the images of these points under 7 are distinct, this shows that degC’ = 2 
and V + C is generically a one-to-one map. The following pictorially summarizes 
this computation: 


homoto 
PY 


| 
—— 


<< 


One can extend the membership test described above to decide membership 


in 7(V) [£93]. The following describes an application from relating to 2 x 2 
matrix multiplication. The book provides extensive details relating matrix 
multiplication, tensors, and complexity. 


EXAMPLE 2.20. Multiplication of 2x 2 matrices is a bilinear map C* x C+ > C# 
which can be represented as the following tensor in C+ @ C4 @ C?: 


T = ay, ® by ® C41 + a42 ® bg] ® C41 +411 ® yg @ CyQ + GyQ ® bag ® yg + 
a21 ® by, ® C21 + azz ® bai ® C21 + 21 @ by12 ® C22 + a2 ® bog ® 22. 


The 8 terms listed in T correspond with the 8 scalar multiplications needed to 
multiply to 2 x 2 matrices using the standard definition of matrix multiplication. 
For m > 1, consider the variety 


Om = {> £;(a) ® £(b) @ é3(c) | & linear form on otf 
i=1 


The minimum m such that T’ € oy is called the border rank of T. The definition 
of T above shows that the border rank is at most 8. Strassen showed that 
the border rank of T is at most 7 via the decomposition 


T = (a11 +422) ® (b11 + b22) ® (C11 + €22) + (21 + a22) ® b11 ® (C21 — 22) 
+ ay1 ® (b12 — b22) ® (C12 + C22) + a22 ® (b21 — b11) ® (11 + €21) 
+ (a11 + a12) ® bo2 ® (C12 — C11) + (G21 — 11) ® (b11 + b12) ® C22 
+ (a12 — a22) ® (ba1 + be2) @ c11. 
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Thus, the border rank of T is 7 if and only if T ¢ og. By viewing a¢ as the closure of 
an image under a projection map, numerical algebraic geometry was used in to 
show deg og = 15,456 by computing a witness point set for og. Then, membership 
testing using this witness point set showed that T ¢ o¢. << 


Two questions immediately arise from this last example. First, how can one 
verify the degree was computed correctly using numerical algebraic geometry? Sec- 
ond, can one use witness point sets to determine polynomials which vanish on a 
variety? These questions are addressed over the next four topics. 


Trace Test. Given a subset W of a witness point set VM LZ where V is irreducible, 
one aims to decide if W = VNL. One approach for deciding equality is via the trace 
test first described in [326]. Let M; be a family of parallel linear spaces linearly 
parameterized by t with Mo = L. The idea is to consider the sum of the solution 
paths starting at the points in W defined by VM M;. One has a complete witness 
point set if and only if this sum of the solution paths is a linear function of ft. 


EXAMPLE 2.21. To demonstrate the essence of the trace test, consider the 
irreducible bivariate polynomial 


f(a,y) = a? — xy + 2y? +32 —2y4+1 


so that f = 0 defines an irreducible curve C C C? of degree 2. The intersection 
of C' with the parallel family of lines MM; defined by y = 3x + t is described by 


(2.29) y=3a+¢ and f(x,32+t) = 162” + (11t—3)a + (24? — 2¢+1) =0 


yielding two solution paths 


1 
(x(t), y (1) = gp (8-1 J Tt + 62t —55,9-t+3 —7t? + 62t— 55). 


Clearly, both solution paths (x+(t),y:(¢)) and (#_(t),y_(t)) are nonlinear (and 
not polynomial) in t showing that deg C > 1. However, their sum 


(04) +2-O,y() +¥-O) = B@B-1b9-4) 


is linear in t showing that deg C' = 2. 

The value of 7+(t) + v_(t) is derived directly from f in by simply ob- 
serving that this sum is equal to the negative of the coefficient of x (linear in t) 
divided by the coefficient of x? (constant) by writing 


f(x, 3e+t) = 16(2—2,(t))(a—a_(t)) = 16(2? — (24.(t)+2_(t))-2+24(t)-2_(d)). 


This sum is called the linear trace which arises from the linear elementary symmetric 
function e;(a,b) = a+ 6. One can also provide a similar statement using any 
nonconstant elementary symmetric function. For example, e2(a,b) = a- b yields 


1 

(v4 (t)-2_(),94() -y-() = 56 

which is quadratic in ¢ and again shows that deg C = 2. Similar with x, (t)+2_(¢), 
the quadraticity of x+(t)-a_(t) is obtained from f in (2.29) where the value is equal 
to the coefficient of 1 (quadratic in t) divided by the coefficient of x? (constant).<> 


(2? — 2¢ + 1,4? —9¢ +9) 


From a numerical computational standpoint, one decides linearity of an analytic 
function by evaluating it at three (or more) points and testing linearity of the points. 
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EXAMPLE 2.22. The following illustrates using numerical algebraic geometry 
to compute the degree of the hypersurface V C C!° which is the closure of the 
bivariate cubic polynomials that can be written as a sum of three cubes of linears: 


i=l 


v= facet 


3 
ag + ax + agy +--+ + ay? = S- (bio + dire + baad} . 


In other words, V = ¢(C®) where 


big + B39 + B30 
3(Bajbia + bayer + b3yba1) 
3(bejbin + baybon + baybaa) 
3(biob{1 + b20b51 + b30b31) 
ob11b12 + b29b21b22 + b39b31b32) 
3(brobjz + b20b52 + b30b32) 
bi, + 63; + b3, 
3 b?,bi2 b3, bea b3,b32) 
3(b11b7, + b21b32 + b31b32) 
big + b3o + 30 


We consider intersecting V with the parallel family of lines M; parameterized by 


1 0 0 
—2 1 —1 
—2 -1 —2 
—3 —2 —3 
a=s = + : + : = 
—3 1 —2 
13 1 —2 
11 2 —3 
13 3 2 
—2 1 3 


Therefore, if s = do is linear in t, then all coordinates are linear in t. For generic tf, 
homotopy continuation finds that V 9 M;, consist of 4 points. The following table 
shows the values of s = ag (rounded to 4 decimal places) for the 4 paths, say 
si(t),...,84(#), for some values of t: 


t | s(t) so(t) a(t) sa(t) | D5 s(t) | Dh sald) 
—1 | —0.4157 —0.3498 0.3421 1.6696 | —0.4234 1.2462 
O | —0.5646 —0.1876 0.2573 1.5219) —0.4948 1.0271 
1 | —0.7269 —0.0025 0.1492 1.3883 | —0.5803 0.8080 


From this data, we test for linearity as follows. If a function r(t) is linear, then 
(2.30) r(1) + r(-1) — 2r(0) =0. 


For r(t) = S7°_, si(t), we have r(1) + r(—1) — 2r(0) = —0.0141 4 0 showing that 
S>*_, si(t) is not linear in t. However, for r(t) = = s;(t), (2.30) holds. By the 
randomness of M;, the points t = —1,0,1 were general so we can conclude that 
ae s;(t) is linear in t. Therefore, the trace test concludes deg V = 4. << 
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In practice, the coefficients of M; are chosen with random complex numbers 
and the trace test is numerically performed using various precisions to provide a 
robust test of linearity. 


Estimating Degrees. The trace test is used to determine if a complete witness 
point set has been found. If the purported witness point set is not complete, 
the trace test does not provide any indication of how many points are missing. 
A statistical estimation of the degree can be used for such an indication. The 
following describes an estimation approach based on subset sampling with 
other estimation methods provided in [18][292]. 

Suppose that W is a known subset of a witness point set VM where the aim is 
to approximate d = deg V = #VNL. An estimation of d is obtained by computing 
a uniformly at random subset Q C VOL of size 1 < € < d and comparing the sizes 
of W, Q, andWnQ. If WO QF9, the maximum likelihood estimate for d is 


#W = #W-#Q 
aya ~ 40 


The main denominator in the first expression is the ratio of repeats which is an 
estimation of the relative size of W to VM L. Hence, if the ratio of repeats is large, 
this suggests that W almost contains all of VM LZ. Conversely, if the ratio of repeats 
is small, then there are many points in VM L not contained in W. 

The main challenge to employ such a statistical estimation is the ability to 
actually generate a uniformly at random subset Q of VN L. In [192], a monodromy 
loop, which is discussed in Section 2.2] starting with some subset of W is used to 
generate the set Q. 


EXAMPLE 2.23 (Estimating the Nine-Point Problem). In Section [2.2] regener- 
ation was used to solve the nine-point problem for four-bar linkages. This compu- 
tation confirmed the result from of 1442 distinct four-bar mechanisms. We 
start with one known solution. At each stage, a monodromy loop, which moves 
the nine points in a loop, is performed starting with all known solutions so that 
#W = #Q. The following summarizes the number of known solutions, number 
of repeats, and an estimate of the total number of solutions when the number of 
repeats is not zero at each stage in our experiment. 


stage ||] 1] 2|3]4]5 | 6 | 7 8 9 | 10 ra 12 13 14 | 15 
FEW L 2 4 8 16 32 64 125 239 429 740 1108 1364 1436 1442 
F#WOQ 0 0 0 0 0 0 3 11 49 118 372 852 1292 1430 1442 
estimate 1365 1420 1166 1560 1472 1441 1440 1442 1442 
This confirms the number 1442 computed in Example 2.15] < 


Vanishing Polynomials. Witness point sets provide a geometric description of a 
variety. In some instances, algebraic information in the form of polynomials that 
vanish on the variety is also of interest. This can be accomplished using sampling 
and interpolation. 

When searching for vanishing polynomials with integer coefficients, one ap- 
proach described in is to utilize an integer relation algorithm such as LLL 
and PSLQ with one general point. Given a vector z € C*, integer 
relation algorithms seek a vector a € Z such that a-z = 0. By taking z to be 
a vector obtained from evaluating polynomials p;(a),...,pe(x) at a general point 
which can be computed to arbitrary accuracy via adaptive precision computations, 
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one is searching for coefficients a € Z such that ayp;(x) + --- + agpe(x) vanishes 
on the variety. 


EXAMPLE 2.24. Suppose one aims to compute the polynomial vanishing on the 
curve C = 7(V) C C? in Example [2.19] Since the points in (2.28) were defined by 
intersecting C with the line 4% — 2y = 3 having integer coefficients, integer relation 
algorithms would be influenced by this line. Thus, we simply homotop one point 
to CMM’ where M’ is defined by 42 — 2y = 2\/2 to obtain the point 


(2.31) bi 5(2V2 +V3,2V3—- Va) Ec. 


Even though we know that no linear polynomials vanish on C' since CC C? is 
an irreducible curve of degree 2, we use this as an illustrative case when searching 
for a nonexistent relation. By taking z = (1,2,y), the PSLQ algorithm in MAPLE 
returns the integer relation vectors 


a = (—102543, 80992, 69932) and a = (—6837579893, 2055056255, 12105959021) 


using 16 and 32 digits, respectively. If we increase the number of digits to 200, one 
obtains 


a = (31626459413779750241529732620573604870665466111587708134149687142, 
—1132884181626097544218282426155782268701314575692937686049386824651, 
2443236354468731668760433223286178277728450296887726023578994829436) 


which indicates that searching for a linear integer relation is futile. 
Next, we search for the quadratic polynomial which defines C' using the vector 
z=(1,2,y,x7,xy,y’). When using 16 digits, one obtains 


a=(—1,0,0,1,0,1) corresponding to a? +y?—1=0. 


In this case, one can easily verify that this is the correct relation by substitution 
since C' is parameterized by 


1-?? Qt 


=. d =, <C 
i‘. 1+ ¢? = y 1+? 


An alternative to using one point to search for integer relations is to utilize many 
points via classical interpolation with numerical linear algebra. As above, suppose 
that pi(x),...,pe(x) are given polynomials and one aims to obtain all a € C* such 
aypi(x)+:--+agpe(x) vanishes on a given variety V. This is equivalent to computing 
the null space of the matrix 


[ pile) = pez), weV ] 


Since the number of columns is @, one only needs at most @ general points of V to 
determine the proper null space. In fact, one has an easy to check stopping criterion 
since adding a row evaluated at a general point either reduces the dimension of the 
null space by one or one already has computed the correct null space. 


EXAMPLE 2.25. We again aim to compute polynomials vanishing on the curve 
C =7n7(V) C C? in Example 2.19] First, linears vanishing on C correspond with 
a € C® such that a; + agx + azy = 0 on C. Using the three known points from 
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(2.28) and @.31), we have the matrix 


1 x 
(64+ V11,3—2V11) |} 1 6+Vi1 S21 
75 (6 — V11,3+2V11) 1 S—vil ave 
1 (272+ V3,2V3 — V2) ||1 22EVE ava-va 


Since the resulting 3 x 3 matrix has full rank, we again confirm that no linear 
polynomials vanish on C. 

To search for quadratic polynomials using the basis 1, 2, y, x7, cy, y?, we com- 
pute additional points on C by moving the intersecting linear space. For example, 


1 x y x xy y? 
(6 he V11,3 = 2/11) 1 S411 s-2V/71 AT12/T1 4-2/1 o8—1aV'IL 
1 (6= in ee 11) 1 6—vil S377) A= 1a =4+9viT eset 
L(2v2+ V3,2v3—y2) 1 24S WEE Ebay ave 
1 2/2-V3 2/3-V2 11-4V6 2-3V6 1444V6 
g(2V2 — V3, -2V3 — V2) |] 1 5 5 35 35 25 
(0,—1) 1 0 —1 0 0 1 
1 24 -7 576 —168 49 
25 (24, =") 1 25 25 625 625 625 
results in a 6 x 6 matrix of rank 5 with the null space spanned by the vector 
a = (—1,0,0,1,0,1) as expected. < 


EXAMPLE 2.26. The following degree 4 homogeneous polynomial that defines V 
in Example 2.22] was obtained via sampling and interpolation: 
144aga3a7a9 — 48apa3a2 — 216aga4qagag + 24aga4a7ag + 144a9a5agag 
- 48apa5a2 - A8azazag + 16a7az2 + 144a,a2aga9 — 16a,a2a7ag + 24a, a3a4a9 
— 16a ,a3a5ag — 8a,azag + 24a,a4a5a7 — 48a, a2a¢ - A8a3.agag + 16a3a? 
— 48a2a3a9 + 24aga3a4ag — 16a2a305a7 — 8a2a7za7 + 24a2a4a5a¢ + 16a2a2 
— 8a3aja5 + a4. 


It is easy to prove that it vanishes on V via substitution since V = ¢(C?). << 


Vanishing Polynomials on Witness Point Sets. A shortcoming of these ap- 
proaches for recovering polynomials is the dimension of the vector space of poly- 
nomials under consideration. For example, the dimension of the vector space of 
polynomials in n variables of degree at most r is ("'"). One way to lighten the 
workload is to consider the simpler problem of computing polynomials that van- 
ish on a given witness point set which are restricted to the linear space used to 
construct it. In such a situation, one always obtains upper bounds in that every 
polynomial which vanishes on the variety restricts to a polynomial which vanishes 
on the given witness point set. If the variety is arithmetically Cohen-Macaulay, i.e., 
the Krull dimension of the coordinate ring is equal to its depth, then there is an 
equivalence between polynomials which vanish on the variety and vanish on a wit- 
ness point set. In fact, one may turn this around and use it to test the arithmetically 
Cohen-Macaulayness of a variety [I17]. 


EXAMPLE 2.27. Consider the twisted cubic curve C = ¢(C) and the quartic 


curve D = 0(C) where 
g(t) = (t,t7,t°) and @(t) = (t,t, *). 
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For L C C® defined by z = 3x — 3y +1, we aim to compare quadratic polynomials 
f(x,y, 2) vanishing on C and D with quadratic polynomials g(a, y,3x — 3y + 1) 
restricted to LD vanishing on CN L and DN L, respectively. 
The set of quadratic polynomials vanishing on C is the three-dimensional vector 
space spanned by 
fi=ay—2, fe =2-y, fs =az—y. 
Restricting to L also yields a three-dimensional vector space spanned by 
gi =cy—324+3y—-1, go =2’?-y, 93 = 307 —3ay—y’?4+c. 


Since C is arithmetically Cohen-Macaulay, this space is exactly the space of qua- 
dratic polynomials restricted to L that vanish on the witness point set 


CNL = {(1,1,1), (-2+ V3,7 + 4V3, —26 + 15V3)}. 


In contrast, the set of quadratic polynomials vanishing on D is the one-dimen- 
sional vector space spanned by f; with corresponding g; when restricted to L. 
However, the space of quadratic polynomials restricted to L that vanish on 


pnb={{ i, 1,1), 5(-3+ V5,—18 + 85,47 21v3)} 


is two-dimensional. It is spanned by g; and 


160a? — 153ary — 21a + 20y? + 21y — 27. 


Hence, D is not arithmetically Cohen-Macaulay since there is a two-dimensional 
set of quadratic polynomials restricted to L vanishing on DM LD while only a one- 
dimensional set of quadratic polynomials vanishing on D. <- 


EXAMPLE 2.28. Returning to the border rank of 2 x 2 matrix multiplication 
from Ex. [2.20] one can first investigate polynomials vanishing on o¢ by looking for 
polynomials which vanish on a witness point set og L and are restricted to L. For 
example, this shows that no nontrivial homogeneous polynomials of degree < 18 
vanish on og ML and hence the same is true for og. For comparison, the space of 
homogeneous polynomials of degree 18 in C®* has dimension 


(“ +18 


18 ) = 456,703 ,981,505,085,600 


while the corresponding computation on og 7 L restricted to L only has dimension 


ex 


= 7,315. 
ie 7,315 


In degree 19, there is a 64-dimensional space of polynomials vanishing on og L. 
Representation theory was used in to describe the corresponding polynomials 
vanishing on og. To complete the story, an explicit polynomial vanishing on og was 
then constructed which did not vanish at the 2 x 2 matrix multiplication tensor T 
confirming that T ¢ o¢ thereby showing that the border rank of T is 7. << 


Sums of Squares Modulo an Ideal. A polynomial p € R[z1,...,%»] is a sum of 
squares if there are gi,...,g¢ € R[x1,..., 2%] such that p = g? +--- +7. Clearly, 
if p is a sum of squares, then p > 0 on R”. In addition to R”, one can consider 
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certificates of nonnegative on other real varieties. For a variety V C C”, p is a sum 

of squares modulo the ideal I(V) if there are g1,...,g¢ € R[a1,...,@n] such that 
p=gi+---+g? modI(V) 

which certifies p > 0 on VOR”. 

If a generating set of I(V) is known, then sums of squares modulo I(V) where 
the degrees of the polynomials g; are bounded can be determined using semidefinite 
programming (SDP). Rather than using defining polynomials of V, an alternative 
approach described in is to utilize sample points. If aj,...,a@m € V, a necessary 
condition for a polynomial p to be a sum of squares modulo I(V) is that there exists 
gi,---,ge € R[x1,...,@n] such that p(a;) = gi(a;)? +---+ge(a;)? for j =1,...,m. 
This necessary condition where the degrees of g; are bounded can also be solved 
via an SDP. The solution of this necessary condition can be checked to see if the 
purported decomposition is correct via testing at one extra general point in V. If 
the test fails, one adds additional sample points and solves again. A finite number 
of general points on V suffice to either compute a decomposition if it exists or show 
that no such decomposition exists when the degrees of g; are bounded. 


EXAMPLE 2.29. To illustrate the advantage of using sample points, we utilize 
a modification of the trace ratio problem described in $7.4]. Define 


Ame ={e eRe | x =X? =X, trace x) =e}. 
For symmetric matrices A, B,C € R™*™ where B is positive definite, compute 
_ trace(AX) 
~ X€Xm,e trace(BX) 


* 


7 + trace(CX). 


For p(X, 7) = trace(BX)(7 — trace(CX)) — trace(AX), an equivalent problem is 
y* =min y such that p(-,y) >0 on Xn ¢ 
vyeER , 


which is relaxed to 


03 = min y such that p(-,7) is a sum of squares mod I(%m,c). 
VE 


When searching for a sum of squares decomposition with deg g; < 1, the following 
from Table 2] shows the improvement of using sampling over defining equations: 


Equations SDP Sampling SDP 

m J variables constraints time(s) | variables constraints time(s) 

4 2 342 188 0.47 56 45 0.10 

5. 3 897 393 0.71 121 105 0.11 

6 4 2062 738 1.34 232 210 0.15 

7 5 4265 1277 3.62 407 378 0.19 

8 6 8106 2073 9.06 667 630 0.34 

9 7] 143887 3198 23.83 1036 990 0.61 

10 8] 24142 4733 58.17 1541 1485 118 & 
Note that the sample points a1,...,@m, used above were selected in V and not 


necessarily in VM R”. In other situations where required, the following describes 
an approach for sampling real points on a variety. 


Sampling Real Points. In 1954, Abraham Seidenberg stated what he 
called a “well-known” fact that if a variety contains real points, then there is a real 
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point on the variety of minimal Euclidean distance to the origin. This is an easy 
consequence of the extreme value theorem. Moreover, each connected component 
of the set of real points has such a minimal point thereby providing the basis for 
a method to compute a point on each connected component of the real points of a 
variety. 


EXAMPLE 2.30. The cubic plane curve defined by 
f(x,y) = 3a? + 24x7y — 6a? + 482y? — 24ry+1=0 


has three connected components where the minimizers of Euclidean distance to the 
origin on each connected component are shown in the following picture: 


<p 


One shortcoming of measuring distance to the origin is that the correspond- 
ing minimizers need not be unique. For example, every point on the unit circle 
x? +y? =1 is equidistant to the origin. However, minimizing distance to a ran- 
domly selected point will yield a unique minimizer on each connected component. 
Thus, for a variety V C C” and a randomly selected p € R”, one aims to solve 
(2.92) min{||z — p||5 | 2 € VN R”} 


where ||x — p||3 = (v1 — pi)? +--+ + (@n — pn)”. Similar to Section [2.2] one replaces 
solving (2.32) with computing the critical points of (2.32) via solving a polynomial 
system, namely computing x € V such that 


(2.33) rank{ c—p Jacy(x) | <n—dimV 
where Jacy (x) is the Jacobian matrix of V at a. 


EXAMPLE 2.31. The critical points for Example. [2.30] when p = 0 satisfy 


of 
ae of 
f(z,y)=0 and rank a | i 
Yo By 
Enforcing the rank condition on the matrix by setting its determinant to zero yields 
3x3 + 24x7y — 6x? + 48xy? — 24ry +1 hy 
24x + 87x?y — 24x? — 48ary” + 12ry — 48y? + 24y? | 
This system has 7 solutions: three real minimizers plotted above in Example 2.30] 
which are approximately 


(0.3408, 0.1277), (—0.2164,0.7026), (—0.1641,—0.1441) 


and four nonreal solutions which are approximately 


(0.1971 + 0.2100\/—1, 0.3842 + 0.0912\/—1), (0.0483 + 0.1719.\/—1, —0.0336 + 0.2498\/—1). <> 
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A drawback of (2.33) is that every singular point of V is considered a critical 
point. In [299], the rank condition (2.33) is enforced using determinants and an 
infinitesimal is used to perturb the system thereby yielding a finite subset of the 
critical points containing the global minimizer on each connected component. An 
alternative approach presented in uses a null space, which is equivalent to 
the Fritz John conditions, to enforce (2.33) together with a homotopy in order to 
compute a finite subset of the critical points containing the global minimizer on each 
connected component. The following illustrates this homotopy-based approach. 


EXAMPLE 2.32. Consider the hypersurface V C C3 defined by 
f(x) = (#2 — xf) + (1a2 — 23)” + (a1v3 — £3)? = 0 
for which V MR? is the real part of the twisted cubic curve. Let p = (—1, 2,2). 


Every point in V MR? is a singular point on V and hence a critical point of (2.32). 
For, say, € = 1+ /-—1, we can compute a finite set of critical points via 


a f(a) — et _ 
H(a,d;t) = [a—p Vf(2) JA =0 


where A € P!. Solving at t = 1 yields 32 start points for the homotopy H. All 32 
solution paths converge as t — 0 yielding: 

e (1 path each) 10 nonreal points which are smooth points on V 

e (2 paths each) 3 real and 2 nonreal points which are singular points on V 

e (3 paths each) 4 nonreal points which are singular points on V. 
A complete accounting of the paths is 32 = 10-1+5-2+44-3. The 3 real points 
listed in increasing distance from p are approximately 

(1.2138, 1.4734, 1.7884), (—0.5969, 0.3563, —0.2127), (0.2335, 0.0545, 0.0127) 


which are the global minimum, a non-global local minimum, and a non-global local 
maximum, respectively, as observed in the following picture: 


APE eeneeeee 


Flan 
PALIT \ 


<> 


It is easy to check that the selected choices of p and ¢ in Ex. [2.32] were generic 
in that one would obtain the same number of start points (32) and a similar ac- 
counting of the endpoints (832 = 10-1+5-2-+ 4-3) for generic choices of p and 
e. The term Euclidean distance degree was coined in to describe the generic 
number of critical points of the distance function for a variety. For example, the 
Euclidean distance degree of the smooth quartic hypersurface defined by f(x) = ¢€ 
from Ex. 2.32]is 32. 
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By treating the point p as a parameter, one can utilize a parameter homotopy 
to efficiently solve many different instances of (2.32). 


EXAMPLE 2.33. Consider sampling real points from $ C C? defined by 
f(x,y, z) = 5(4a? + Oy? 4+ 42? — 4)3 — 423 (802? + Oy?) = 0. 

This surface is called stiss which means “sweet” in German since the real part 
S'MR° resembles the boundary of a candy heart. For generic € and p, the Euclidean 
distance degree of the smooth surface defined by f = € is 96 which homotop to the 
following breakdown of critical points on S as € > 0: 

e (1 path each) 34 smooth points on S 

e (3 paths each) 2 singular points (0,0,1) € S 

e (6 paths each) 4 points on the singular curve around the “waist” of S which 

is defined by 

(2.34) Ag? + 9y? -4=z=0 

e (8 paths each) 4 singular points (+,/—1/19, +.\/80/171,0) € 
with 96 = 34-1+2-3+4-6+4-8. The irreducible components of the singular 
set of S are the singular curve defined by (2.34) and the isolated points (0,0, +1). 
The other 4 singular points are embedded points on the singular curve. 


The following picture plots the real critical points of the distance function to 
200 randomly selected points p € [—2, 2]? obtained via parameter homotopies: 


<b 


Rather than use random choices of p which clusters sample points in high 
curvature regions, one can be more clever in the choice of test points to create a 
provably dense sampling of any compact subset of a real variety. This can be useful, 
for example, in topological data analysis {135}. 


Final Comments. The purpose of this section was to provide a sampling, with 
many omissions, of applications of computing and using sample points in numerical 
algebraic geometry. One common theme of these applications is the extraction of 
information about a variety from some particular set of points on the variety. 


CHAPTER 3 


Geometric Modeling 


The interactions between geometric modeling, algebraic geometry, and commu- 
tative algebra are rich and varied. The first two sections of the chapter discuss the 
geometry and the algebra of geometric modeling. In the third section, Hal Schenck 
goes more deeply into the algebra. 


3.1. Geometry of Modeling 


Geometric modeling involves curves in R? and R° and surfaces in R?. We will 
begin with curves and then move to the richer topic of surfaces. 


Bézier Curves. As computer-aided design developed in the 1950s and 1960s, 
Paul de Casteljau of Citroén and Pierre Bézier of Renault independently created 
and implemented systems based on what we now call Bézier curves[| These curves 
use blending functions Crd —t)"~* and control points Po,..., Py € R* to give 
the parametrization 


® : [0,1] — R® 
i=0 


A nice introduction to Bézier curves can be found in Chapter 2]. 
Here are some Bézier cubics that illustrate the convex hull property: 


VB 


In general, S79 (")t*(1 — t)"~* = 1 implies that ®(t) is a convex combination of 
Po,..., Py and 0 Gh i in a convex hull nena mre 20 a 
Tangent Lines and Curvature. A designer selects control points Pp,...,P, to 


create a Bézier curve. A nice feature of these curves is that it is easy to understand 
the tangent vectors and curvature of Bézier curve at the endpoints Py and P,. 

For simplicity, we focus on Py, where t = 0. An easy calculation using 
shows that the tangent vector at Po is 


©’ (0) _ n( Py = Po). 


1De Casteljau was first but not allowed to publish until six years after Bézier’s 1968 paper. 
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When n = 3, we get a Bézier cubic. Here is a picture: 
Ps 


ee 


This curve has control points Po, P,, P2, P3 linked by line segments. The tangent 
vector ®'(0) = 3(P; — Po) is shown in gray. 
The dotted lines in the picture relate to curvature. Let 
a = distance from P; to Py = ||P; — Pol 
<a 

h = distance from Py» to the line PoP. 
Then one can prove that the curvature at Po is given by the formula 
n—-lh 


n az 


(3.3) K(0) = 


The Bézier cubic (8.2) has a lot of curvature at Po. This is obvious from the picture 
and also from the formula since a is much smaller than h. 

One nice feature of these formulas is that when t = 0, we have: 

e The position is determined by Po. 

e The tangent vector is determined by Po, P,. 

e The curvature is determined by Po, P,, P2. 
The position of the other control points has no effect on these quantitites. 

There are analogous formulas for the tangent vector and curvature when ¢ = 1 
that depend only on P,—2, Pn—1, Pn- 


Visualizing Curvature. In differential geometry, curvature is often introduced 
via the osculating circle, which is the circle that best approximates the curve at a 
given point. Its radius is the reciprocal of the curvature and its center is the center 
of curvature. If N(t) and «(t) are the unit normal and curvature at ®(¢), it follows 
that the center of curvature is 


(3.4) b(t) + —-N(t). 
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For a Bézier cubic, this is illustrated as follows: 
P3 


a 


osculating 
circle 


1/K(t) 


center of 


Po curvature 


For a single point, this works nicely, but for multiple points, the osculating 
circles create a cluttered picture. It is simpler to focus on the centers of curvature, 
the curve of centers of curvature. However, its distance to original curve is the 
reciprocal of the curvature, making it harder to see how the curvature varies. 

In geometric modeling, it is customary to replace the center of curvature (8.4) 
with the point 

@(t) — K(t)N(t). 
Thus we work on the other side of the curve and consider points whose distance to 
the curve is the curvature rather than its reciprocal. The resulting picture is the 
curvature comb: 


A web search of “AutoDesk Alias curvature comb” will lead to some nice images. 


The de Casteljau Algorithm. The curvature comb is nice but requires knowing 
the curvature at many points. Right now, we only know the curvature at the 
endpoints. The de Casteljau algorithm gives us access to more points on a Bézier 
curve, as we illustrate for a Bézier cubic with control points Po, P,, P2, Ps: 

e Take successive midpoints to get Po1, Pi2, Pas. 

e Take successive midpoints of Po1, Piz, P23 to get Poi2, Pi23. 

e Take the midpoint of Pou, Py23 to get Po123. 
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These successive midpoints are shown on the left of the following figure: 


Po 


On the right, we have drawn the Bézier cubic for the control points Po, P,, Po, Ps. 
The magic is that this curve breaks up into two Bézier cubics: 


e The left half is the Bézier cubic with control points Po, Po1, Po12, Po123- 
e The right half is the Bézier cubic with control points Po123, P23, P23, P3. 


Since we know the curvature at the endpoints of a Bézier curve, we now know the 
curvature at Po123. Iterating this process, we can find the curvature at many more 
points on the curve. Hence the curvature comb becomes easy to compute. 


Two Variations on Bézier Curves. The Bézier curve (8.1) is defined on the 
interval [0,1] and features the binomial coefficients ("). Both of these can be 
generalized: 


e Change the interval. We can switch the interval from [0,1] to [a, }] via 


n 


1 n ; : 
®,, 4(t) = ——— t—a)'(b—t)”"'P,, te [a,d). 
en = Gage ds ({)e-O-9 € [a0 
The convex hull property holds since )>7_, (")(¢— a)'(b — t)"“* = (b-a)”. 


e Use general weights. Instead of (ep use positive weights w;. This gives 


1 a ; 
®,,(t) = : 0—t)?'*P, +t€ (0,1). 
(t) PS we 8) (0,1 


0 Wi 
The denominator guarantees that the convex hull property holds. For most 
weights, this is a rational parametrization. The denominator never vanishes 
since t € [0, 1] and the weights are positive. 

It is also possible to combine these two variations. Here is an example that gives a 

preview of the connection to toric geometry. 


EXAMPLE 3.1. Consider the interval [0,7]: 


0 1 2 3. «4 5 ae n-1l n 
This is a one-dimensional lattice polytope with lattice points 0,...,n and facets 
0,n. For positive weights wo,...,w, and control points Po,..., Pn, we get the 


weighted Bézier curve 
1 


B(t) = ys wt! (n = t) 


— > witi(n-t)""P;,  t € [0,n]. 
1=0 
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The blending functions t'(n — t)"~* are indexed by the lattice points i = 0,...,n 
and have two interesting properties: 
e For t € [0,n], the functions t and n — t give the distance from t to the facets 
0, n of [0, nl], 
e For each index 7, the exponents 7 and n —i give the distance from the lattice 
point 7 € [0,n] to the facets 0,n. 


Later in the section we will explain how this interpretation of t’(n — t)"~' can be 
generalized to define blending functions indexed by the lattice points of an arbitrary 
lattice polytope. <— 


Font Design. There are many ways to design fonts such as Times Roman or 
Helvetica. In an outline font, one gives information that specifies the outline of 
each letter. Common examples include: 

e TrueType fonts, which use quadratic Bézier curves. 

e Type 1 PostScript fonts, which use cubic Bézier curves. 


EXAMPLE 3.2. In the TrueType font Monotype Arial, the letter “b” is built 
from line segments and quadratic Bézier curves of the form 
&(t) = (1 — t)?(ao, bo) + 2t(1 — t)(ai, b1) + t7(a2,b2), +t € [0,1]. 
This curve goes through 0 = (ao, bg), 0 = (a2, b2), and O = (a1,b,) “pulls” the 
curve. In this notation, a Bézier quadratic has control points O00, 
Now consider the whole letter: 


The line segments are indicted by consecutive L’s, and the rest of the outline is 
composed of Bézier quadratics. For example, the points marked 1, 2, 3 in (3.5) are 
the control points 0,0,U that define one of the Bézier quadratics. 

The consecutive ©’s in the picture seem to indicate missing control points. 
When this happens, the control point is not shown because it is the midpoint of the 
the line segment connecting the consecutive O’s. This midpoint convention allows 
the outline of the letter to be specified more efficiently. <P 
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Bézier Surfaces. We begin with the most common Bézier surface, the tensor 
product surface of degree (m,n). Given control points Pj; € R“, 0 < i < m, 
0<j <n, we get the parametrization 


(3.6) (3,2) = >> @ C si(1—s)™-*49(1-4)"-9P,,, (s,t) € [0, 1] x (0, J. 


i=0 j=0 
The image is called a surface patch since it is the portion of the surface parametrized 
by the unit square. Note that the convex hull property holds since the coefficients 


in @.6) sum to 1. 


Here is a splendid example of a bicubic Bézier patch (degree (3,3)) taken from 
the paper [160}: 


(3.7) 


Note that the edges of this patch are Bézier cubics. Thirteen of the 16 control 
points P;;, 0 < i,j < 3, appear in @.7). The red lines will be explained later in 
the section. This well-known image appears on the cover of the SIAM Journal on 
Applied Algebra and Geometry shown on page [95] 

Bicubic patches are widely used to design automobiles, airplanes, and buildings. 
A nice example is the Guggenheim Bilbao: 


This building was designed in CATIA, a program created by the French aerospace 
company Dassault Systems. Each titanium sheet is a Bézier patch. The book 
Digital Gehry describes the process used to create this amazing structure. 
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SIAM Journal on 


Applied 
Algebra and 
Geometry 
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We can rescale the formula (8.6) to write a Bézier patch of degree (m,n) as 


1 m\(n\ ; es : 
3.8 B(s,t) = ——— i(m—s)™—*#) (n—t)"-5 P,, 
(3.8) (s,t) wos (TGF) synteny Py 
for (s,t) in the rectangle P = [0,m] x [0,n]. The denominator m’n” is needed for 
the convex hull property. 


The sum in (8.8) is over the lattice points PM Z? of P: 
(0, n) (m,n) 


(0, 0) (m, 0) 


For each lattice point (i,7), the blending function s‘(m—s)"™~‘t)(n—t)”~ is built 
from the distance to the facets of P. We will soon generalize this to an arbitrary 
lattice polytope. 


Triangular Bézier Patch of Degree n. Let Ay = Conv(0, e1, e2) be the standard 
simplex in R?. Given n > 0 and control points P;; € R’,i,j>0,i+j7 <n, we 
have the triangular Bézier patch 


1 n! <3 _ 
— C47 e—t—J bs 
(3.9) B(s,t) = — 2s Wns ti(n—s—t) P; 
i+j<n 
for (s,t) in the triangle P = nAg. As above, the n” is needed for the convex hull 
property. The sum is again over the lattice points of P: 


(0,7) 


(0, 0) (n, 0) 


The blending function s‘t/(n —s—t)"~*~/ indexed by (i,j) is built out of distances 
from (s,t) and (i,7) to the facets, but for the facet opposite the origin, one needs 
to use the notion of lattice distance, to be defined in (8.10) below. 


Weighted Surface Patches. The weighted versions of tensor product (3.6) and 


triangular (3.9) surface patches replace ales) and aT Cel with positive weights 
w;;. For example, the weighted patch defined on [0,m] x [0,7] is 


1 - an - 
Yi Wigs? (M—s)™—*t (n—-t)"—J de wigs! (m—s)™ ey Es 
aj aj 


®(s,t) = 


The denominator ensures the convex hull property and doesn’t vanish on P since the 
weights are positive. Similar to the curve case, weighted surface parametrizations 
typically involve rational functions rather than polynomials. 
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Lattice Polytopes. In 2002, Rimvydas Krasauskas found a way to generalize 
the Bézier surface patches (8.6) and (8.9) to an arbitrary lattice polytope P C R4 
of dimension d. 

The polytope P is the convex hull of a finite set A C Z%, though for us, the 
facet presentation of P is equally important. A facet F' C P is a codimension-one 
face whose supporting hyperplane is defined by 


(m,n) = —ap, —apr €Z, np EZ”. 


We specify this equation uniquely by requiring that nr € Z” be primitive (meaning 
ne ¢ mZ” for all integers m > 2) and inward-pointing. Then the desired facet 
presentation is 


P={me€R*| (m,nr) > —ar for all facets F}. 
Lattice Distance. For a point m € P, the quantity 
(3.10) hr(m) = (m,nr) +ar > 0 


measures the number of parallel integral hyperplanes one must cross in order to get 
to the facet F. Here is an example of what this looks like in R?: 


) \ Yy oy 
“o& 4 
ee ee ee: 
e : e@ : oe ; e@ 


hr(m) = 2.5 


We call he(m) the lattice distance from m to F’. Note that hr(m) need not be the 
Euclidean distance. In this notation, the facet presentation becomes 


P={me€ER"|hr(m) = 0 for all facets F}. 


Toric Bézier Functions. Fix a finite set A C Z¢ and let P = Conv(A). For 
a € Aand m€ P, define the toric Bézier function 


(3.11) AS Gn) =| hein), 
F 


where the product is over all facets F of P. The exponents are nonnegative since 
a € P and £"*(m) > 0 since m € P. Here are some familiar examples: 


e P = {0,n] gives the functions t'(n —t)"~* used in the Bézier curve defined in 
Example 

e P= [0,m] x [0,n] gives the functions s‘(m — s)™~*t?(n — t)"~J used in the 
tensor product Bézier patch (8.8). 

e P =ndAzg gives the functions s*t/(n—s—t)"~* used in the triangular Bézier 


patch (3.9). 
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Toric Patches. The Bézier formulas (8.1) and (8.8) use binomial coefficients 
and (8.9) uses the trinomial coefficient ay Since these do not have an 
obvious toric generalization (more on this below), a toric patch uses positive weights 
w = {wa}aca and control points a laca C R* to define ®4,, : P > R® by 


5 Waser (m meP. 


Wea w tm ery 


A common choice for A is the set of all lattice points in P. This is what happens 
for Bézier curves and surfaces. Here is an example where we use fewer. 

EXAMPLE 3.3. Let A= {a; = (0,0),a9 = (1,0), a3 = (1,1), a, = (0,2)} C Z?. 
Notice that A does not use all lattice points in P = Conv(A). 


(3.12) DA w(m) = 


polygon: control points: 
a = (0,0, 0) 
2—s—-t=0 = (.5, 1,0) 

a3 a = (0, 2,0) 

s=0 1—s=0 P, = (0, 1,2) 
af;—o 2 weights: 1,1,1,1 

toric Bézier functions: 

Bere(s t) = s°t9(1—s)1(2—s-t)? 

Bier*(s,t) = s14(1 — 5)°(2— 5-4)! 

Bie"*(s,t) = st1(1 — 9)°(2— s - #)° 

Bi"*(s,t) = 9#(1 — 9)!(2—s- 4) 


<< 


EXAMPLE 3.4. Here is an example from that uses all lattice points: 


oO 


The “bulge” in the surface is governed by the control point indicated here. <> 


Degenerations of Toric Patches. By letting the weights go to oo, we can 
degenerate a toric patch into simpler patches. To illustrate how this works, we 
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begin with curves. A Bézier curve with standard weights w; = ('/) and control 
points Po,...,P, goes through Po and P,, and is “pulled” toward P),..., Pn—1. 


EXAMPLE 3.5. Here is a Bézier quartic with standard weights 1, 4,6, 4, 1: 
Bs 


Pi 


Po 
Py 
P2 


We can increase the “pull” of P,, P2, P; by varying the weights. But there are many 
ways to do this. If we use the weights 


(3.13) 1,4w%, 6w®, 4w7, 1, 
then we get the weighted Bézier quartic 


(1—t)* Po + 4w%t(1—t)? P, + 6w t?(1—t)? Po + 4w7#3(1—t) Ps + t4* Py 
(1—t)4 + 4wet(1—t)? + 6w8t?(1—-t)? + 4w7t3(1-t) + t4 


&(t) = 


defined for t € [0,1]. When we let w — oo, the limit curve that depends on the 
exponents a, 3,y in (8.13). Each picture (a), (b), (c) below shows the original 
Bézier quartic in black and two weighted versions, one in light blue with bigger 
weights and one in dark blue with much bigger weights: 


ARAAAR 


a) weights: b) weights: c) weights: 
ee »4w3, 1 Pere ans 
exponent plot: exponent plot: exponent plot: 


ATN AG 


The exponent plot is explained as follows: 


e The exponents of w in (8.13) are 0, a, 3,7, 0. 
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e The exponent plot consists of the points (0,0), (1, a), (2,8), (3, y), (4,0). 
e The upper convex hull of the exponent plot is outlined in red. 


The limit curve as w — oo is determined by the upper convex hull. The limit curve 
has four line segments in (a) and three in (b). In (c), the limit consists of two line 
segments and a weighted Bézier quadratic since the top of the upper convex hull is 
a copy of [0,2]. You can see the limiting behavior in the above pictures. <— 


EXAMPLE 3.6. We now explain the red lines in bicubic surface patch (8.7). For 
(n,m) = (3,3), the products of binomial coefficients in (3.7) form a square: 


Fw wir 
wow Oo Oo w 
wow Oo Oo w 
Fw wir 


The idea is to introduce weights depending on a parameter w as in Example[B.5)and 
use the upper convex hull of the exponent plot to understand the limit as w > oo. 
We will consider the weights used in : 


upper convex hull 
of the exponent plot 


{ draw red lines 


—> 


project 


regular polyhedral 


decomposition 
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The two weights shown in red do not contribute to the upper convex hull and, 
as indicated by the open circles, are not part of the regular polyhedral decompo- 
sition of P = [0,3] x [0,3] induced by the upper convex hull. The red lines in 
the decomposition of P become the red lines we originally saw in (8.7). This also 
explains why we see only 13 of the 16 control points: one missing point is hidden 
behind the surface, and the other two come from the open circles not used in the 
decomposition. 

Furthermore, as shown in [160], letting w — oo leads to a limiting configuration 
supported on the red lines: 


The limit consists of 9 surface patches: 


e 3 tensor-product Bézier patches. 
e 4 triangular Bézier patches. 
e 2 toric patches (shaded) that are not Bézier! 


This example shows how toric patches can arise in a purely Bézier setting. <— 


The full story of Examples B.5] and B.6]is explained in Theorem 3 of : 


Bernstein Polynomials. The polynomials (") t’(1—t)"~* appearing in the Bézier 
curve (8.1) where introduced in 1912 by Sergei Bernstein in his elegant proof of the 
Weierstrass Approximation Theorem [26]. To approximate a continuous function 
F': [0,1] > R by a polynomial of degree n, Bernstein took a probabilistic approach. 
A player flips a biased coin where the probability of heads is t and tails is 1 — t, for 
some t € [0,1]. Bernstein assumes the player flips the coin n times and is paid the 
sum F'(+) if heads occurs exactly i times. Under these conditions, the expectation 


of the player is 
(3.14) Fn= > F()Mea-0"*. 
i=0 


Bernstein proves that E,, + F uniformly for t € [0, 1]. 

The Bernstein polynomials (’) t’(1—t)"~* are well known in numerical analysis 
and have many nice properties, as explained in Theorem 7.3.1]. A modern 
version of Bernstein’s result can be found in Theorem 5.3.2]. 

Bernstein’s proof of the approximation theorem extends easily to continuous 
functions F : [0,1] — R*. Here, the graph of F is approximated by the Bézier 
curve ®(t) = S7jy ({) t'(1 — t)"-*P;, where the i" control point P; is chosen to 
be the point F (+) on the curve. This explains why Bernstein polynomials appear 


naturally in Bézier curves. In approximation theory, n is usually large. The insight 
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of de Casteljau and Bézier is that in the geometric setting, small values of n lead 
to useful curves. 


Linear Precision and Weights. One nice feature of Bernstein polynomials is 
that for an affine linear function F : [0,1] > R°, the approximation is exact, i.e., 
n 
F(z) => (ti - 2)" *F (4). 
i=0 
This property is called linear precision. 
For Bézier curves, linear precision helps explain the role of the weights w; > 0 
that appear in a weighted Bézier curve 


P(t iti(d a tp. 
() = 0 ue De 


The convex hull property implies that if the P, are collinear, then ® parametrizes a 
line segment, though the parametrization usually has degree n. However, if w; = (7) 
and Po,...,P, are collinear and equally spaced, then linear precision implies that 
® is affine linear. A lot of cancellation is needed for the degree to drop from n to 1. 
Binomial coefficients are indeed special. 


Linear Precision for General Patches. Fix a finite set A C Z% and assume 
that P = Conv(A) C R@ has dimension d. A general patch has two ingredients: 

e Continuous blending functions Ba : P > R>o, a€A, with yacy Ba =1. 

e Control points P, € R¢ for a € A. 
Then the patch is the function ® : P —> R° defined by 
(3.15) =S Binh, meP. 

acA 

Note that general patches satisfy the convex hull property. 


EXAMPLE 3.7. A toric patch (3.12) can be written 


Baul) = > Ass tom 


acA 


where the toric blending function Ztre is the rational function 


a,w 
m : w toric m 
Bae (m) = sea ( ) 


aed UN oles (m) 
and 6%"* is the toric Bézier function defined in (3.11). < 


EXAMPLE 3.8. For blending functions £8 Fue A as above, the canonical choice 
P, =a of control points gives the tautological patch defined by 


o"'(m) = S~ Ba(m)a 
acA 
Note that ®**"*(m) € P by the convex hull property. Thus ®*"* : P > P. < 
For a general patch (8.15), one can prove that the following are equivalent: 
e For any affine linear function A : R4 > R*, we have 


= 5° Balm) A(a), me P. 


acA 
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em = Vaca Ba(m)a for all m € P. In other words, the tautological map 
p"* ; P + P is the identity. 
Patches satisfying these conditions have linear precision. The paper studies 
linear precision in geometric modeling. 


Toric Patches and Strict Linear Precision. Bézier curves (8-1), tensor-product 
patches (3.6), and triangular patches (8.9) have linear precision. These generalize 
to Bézier simploids, which are toric patches built from products of simplices with 
coefficients given by products of multinomial coefficients. See Example 3.8] 
for a precise definition. It is well known that Bézier simploids have linear precision. 


DEFINITION 3.9. A finite set A C Z” has strict linear precision for positive 
weights w = {wa}ac if the toric patch ®,4 ,, (8-12) has linear precision, i-e., if 


m= . J wa8e"*(m)a = S> Byese(m)a 


Daca Waha(m) 24 ac 


for all m € P. 


It is likely that Bézier simploids are the only toric patches with strict linear 
precision. Here is recent result from [99]. 


THEOREM 3.10. Let A be the set of lattice points in a d-dimensional lattice 
polytope P C R¢%. Then A has strict linear precision for positive weights w = 
{wa}aca if and only if 

(1) The sum of the facet normals ng of P is equal to zero. 

(2) The polynomial onc, Waban” is constant. 


The polynomial in (2) is the denominator of the toric patch ®4.,. It follows 
that toric patches with strict linear precision are polynomial parametrizations. 

The proof of Theorem. 10]has a surprise—it uses ideas from algebraic statistics, 
including the concept of maximum likelihood degree from Section 


Algebraic Statistics. The toric blending functions eee defined in Example B.7] 
give a map 


P—+R¢ 
(3.16) a 


w5(™)) aca 

whose image is denoted Y4,~. Points of Y4,., have non-negative coordinates that 
sum to 1, so that Y4.~ C R&, is a statistical model. Example 3.5 of implies 
that Y4.~ is a what statisticians call a log-linear model, though these days the term 
toric model is also used. 

In Chapter 2] we encountered the notions of maximum likelihood estimate and 
maximum likelihood degree from algebraic statistics. The phylogenetic model (2.25) 
discussed in Example [2.17] has ML degree equal to 92. But in some special cases, 
a statistical model has ML degree one, which is nice because this implies that the 
ML estimate is a rational function of the data. 

In 2014 June Huh proved a powerful result about varieties in (C*)” of 
ML degree one. This is relevant because, as we explain below, strict linear precision 
implies that Y4, has ML degree one. Huh’s result plays a central role in the proof 
of Theorem B.10] given in Theorem 7.2]. 


m+ ( 
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At first glance, the relation between algebraic statistics and geometric modeling 
seems unexpected. But keep in mind that statistical thinking inspired Bernstein in 
1912 to define the polynomials that appear so naturally in geometric modeling. 

We now explore one further aspect of this connection. 


Toric Patches and Rational Linear Precision. The definition of strict linear 
precision is very restrictive since it uses the toric blending functions 6Z°%, from 
Example Following [159], we say A has rational linear precision for positive 
weights w = {Wa}aca if there are rational blending functions 6, : P > Rso for 
a CA satsifying ))¢ 4 ba = 1 with the following two properties: 
e The Ba have linear precision, i.e., for all m € P, we have 
m= S- Ba(m)a. 
acA 
e The image of the map 
mr+> (Ba(m)) se 4 
is the statistical model Y4.. defined in (8.16). 


In other words, rational linear precision for A,w means that we can parametrize 
Yaw by rational blending functions that have linear precision. 

Note that strict linear precision implies rational linear precision since the toric 
blending functions satisfy the above condition in the strict case. The converse fails, 
as shown by the following example. 


EXAMPLE 3.11. Consider the trapezoid 


ag as 

Fy F3 

at a3 
a2 


with.A= 4a; = (0; 0),a¢= (1, 0), a5 = (0,0), a4 = (0,1),a¢ = 1) CS’, 

Let h; denote the lattice distance to the facet F;. For weights w = (1,2, 1,1, 1) 
and control points P,,..., Ps, we get the toric patch 

1 
, ( 
(2—t)(2—2t—t?) 

Using Theorem [3.10] we see that strict linear precision fails since the facet normals 
do not sum to zero. Also, the denominator in the above formula is nonconstant. 

However, one can show that for the same weights w = (1,2,1,1,1), A has 
rational linear precision with the blending functions 


(3.17) ms 


hghaP,+2hyhghaP2+hjhaP3+hgh3Ps+hih2Ps). 


hha 2hyhsha hha hgh3 hyhe 
3.18 r> 3 + +. P. Py Ps. 
Ae). ME pe Gage Gate gap aoe 
Formulas (3.17) and (8.18) are similar, but there is no theory that explains their 
relation. It is also unknown which polytopes have rational linear precision. <— 


We close with the lovely connection between rational linear precision and ML 
degree one proved in Proposition 4.1]. 


THEOREM 3.12. A has rational linear precision for w if and only if Yaw has 
ML degree one, and when this happens, the ML estimate gives a parametrization 
with linear precision. 


3.2. ALGEBRA OF MODELING 105 


There is a lot more to say about this subject, including many open questions. 
But for lack of space, we must stop here. You can consult the papers and 
and the references therein for further reading. 


Final Comments. In spite of many missing details, we hope to have convinced 
you that geometric modeling has compelling pictures, wonderful applications, and 
an unexpected connection to algebraic statistics. 

It is now time for the algebra of geometric modeling to take center stage. 


3.2. Algebra of Modeling 


We begin with curves. In the previous section, a weighted Bézier curve in the 
plane was written 


1 ie 
te [0,lJeo a, City eer 
[ ] eae wti(1 = t)r-t » me ( ) 
for control points Po,...,P, € R*. The algebraic structure of this situation is 
best revealed by working projectively over the algebraically closed field C. Since 
P; = (a;, B;) € R? gives (a;, B;,1) € P2 C P2 = P?, we can homogenize to obtain 


(s,t) € Pi +> = (Lmaat (s—t)"-* se (s—t)"* ae (s—t)” “) ep? 


Each entry is a homogeneous vacua of degree n in s,t. 
More generally, a triple of homogeneous polynomials a, b,c € k[s,t] of degree n 
with gcd(a, b,c) = 1 gives a map 
(3.19) ee P* —_ P?, B(s, t) = (a(s,t), b(s,t), e(s,¢)), 
whose image is a curve C C P?. Here, k = Q,R or C. The ged condition guarantees 


that a,b,c never vanish simultaneously on P!, so that ® is defined on all of P'. 
Algebraically, (8.19) gives the homogenous ideal 


(3.20) I = (a,b,c) C k{s, t]. 
Commutative algebra has developed a rich variety of tools to study homogeneous 
ideals in graded rings such as k[s,t]. For the ideal (8.20), we will see that syzygies 


and the Rees algebra relate nicely to concepts discovered by the geometric modeling 
community. 


The Degree Formula. The parametrization involves three types of degree: 
e n = deg(a) = deg(b) = deg(c) is the degree of the polynomials a, b,c € kis, ¢]. 
e deg(C) is the degree of the irreducible homogeneous polynomial F' € k{a, y, 2] 
with C= V(F). 
e deg(®) is the number of points of P! that map via ® to a generic point of C 
when working over C. 


These numbers are related by the following well-known formula. 
THEOREM 3.13. For ®: P! > P? as in (8.19), we have 
n = deg(®) - deg(C). 
EXAMPLE 3.14. ®(s,t) = (s?+t?, 2s”, s?—t?) is generically 2:1 and parametrizes 


the line y = «+z. Thus n = 2, deg(®) = 2 and deg(C) = 1, and Theorem B.13] 
becomes 2 = 2-1. << 
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Implicitization. We call F = 0 the implicit equation of C, and implicitization is 
the process of going from the parametrization ® = (a,b,c) to the implicit equation 
F = 0. This is a problem in elimination theory since eliminating s,t from the 
equations 


x —a(s,t) =y—0(s,t) = z—c(s,t) =0 


gives the implicit equation F(z, y, z) = 0. 

In Section we learned that elimination can be done using resultants or 
Grobner bases. In particular, implicitization via Grobner bases is a wonderful topic. 
Interested readers should consult Chapter 3] for the basic theory and [2] for 
some recent developments. But for curves, the syzygies of the ideal (8.20) give a 
lovely way to think about implicitization that combines algebra and geometry. 


Syzygies. By the Hilbert Syzygy Theorem, the ideal I = (a,b,c) C R= k/s,t] in 
(3.20) has a free resolution 


(3.21) (=+ Ren aj eRGn= js Reap 2a 


with py + pg =n. This follows since a, b,c are relatively prime of degree n. A proof 
can be found in [108]. 


Elements f1,..., fq in a commutative ring R have a syzygy module 


Sya(fis--s fa) = {(hay.-,ha) € R4| 5 eh =o}. 
w=1 


In general, syzygy modules can have a very complicated structure. The exact 
sequence (8.21) tells us that the syzygy module Syz(a, b,c) is free, with generators 
p,q of degrees 


deg(p) = w = min(p1, #2) < max(p1, we) =n — po = deg(q). 


It is customary to call {p,q} a p-basis of the syzygy module. 

Syzygy is an astronomical term that refers to the alignment of three celestial 
bodies in a straight line. In 1853, Sylvester published On a Theory of the Syzygetic 
Relations [341]. This long memoir defines many colorful terms, including the Be- 
zoutians and Bezoutics mentioned in Section [LI] To help the reader, Sylvester 
added a Glossary of New or Unusual Terms, where he defines a syzygetic function 
to be a sum of given polynomials “with arbitrary functional multipliers, which are 
termed the syzygetic multipliers”. He goes on to say 

When a syzygetic function of a given set of functions can be made to vanish, they 

are said to be syzygetically related. 

Thus, what we now call a syzygy on fi,..., fa consists of the syzygetic multipliers 
hy,...,ha that give a syzygetic relation Se hifi = 0. 

The exact sequence (8.21) was proved by Meyer in 1887 [273], who conjectured 
that a similar result should hold for any number of homogeneous generators. Hilbert 
proved the Syzygy Theorem in 1890 [205], though he never used the term “syzygy”. 
Hilbert’s first application of his theorem was a proof of Meyer’s conjecture. 


Moving Lines. A geometric approach to syzygyies was introduced in the 1995 
paper Implicitization of using moving curves and surfaces by Sederberg and 
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Chen [314]. In the affine setting, the parametrization is written 
_ at) — _ bit) 
c(t)’ 7 ety’ 
which we regard as a moving point in the plane. Then a moving line is an equation 
A(t)a + B(t)y + C(t) = 0 that depends polynomially on the parameter t, and the 
moving line follows the parametrization if the moving point lies on the moving line 
for all values of t. After clearing denominators, this gives the equation 
A(t)a(t) + B(t)b(t) + C(t)c(t) = 0, 


which tells us that we are in a syzygy module. This picture translates easily to 
the projective setting, with the result that a moving line Ax + By + Cz = 0 in P? 
follows the parametrization ® = (a,b,c) : P! > P? if and only if 


(A, B,C) € Syz(a, b,c). 
Here, a,b,c, A, B,C € k[s,t] are homogenous polynomials, where a, b, c have degree 


n and A, B,C have degree m for some m, reflecting that fact that Syz(a, b,c) is a 
graded module. 


Intersection of Two Moving Lines. The point of intersection of two moving 
lines is a moving point: 


1 


second 
moving 
line 


(3.22) 


moving 
point 


This picture suggests that the parametrization ® = (a,b,c) should come from 
the intersection of two moving lines in P?. Let’s apply this idea to a p-basis {p,q}, 
which we write as 


Pit + pay + p3z = 0 and qx + qay + 932 = 0. 
Since both go through the point (a,b,c), we get the syzygies 


(3.23) pia + pob+ p3c = 0 and qia + qob + q3c = 0. 


By properties of determinants, we also have the equations 


p1 det is a — po det ie a + p3 det te | ai 


P3 93 P3 93 P2 Q2 
det |?? 2) — go det|/P) 7) + gg det|/?? 7] = 0. 
is qs)" [ps aa} 7 BS" | pe ae 


The determinants have degree + (n — ys) = n, same as a,b,c, so (3.23) implies 
that up to a scalar multiple, the parametrization is given by 


(3.24) a = det Pe z| , b= —det a Al , c= det B A 
3 P3 93 P2 2 
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Thus a,b,c are the 2 x 2 minors (with appropriate signs) of the matrix 


that appears in the exact sequence (3.21). This is the Hilbert-Burch Theorem! 

It is fun to see how the picture reveals the internal structure of the exact 
sequence described by Hilbert-Burch. More interesting is that was 
discovered by the geometric modeling community, which in the early 1990s was 
unaware of free resolutions and syzygy modules. 


Implicitization via Moving Lines. We next present two related implicitization 
methods that use syzygies. 


METHOD 1: MOvING LINES OF DEGREE n—1. A key observation is that there 
are n linearly independent moving lines of degree n — 1. In terms of the syzygy 
module, this translates to the equality 


(3.25) dim Syz(a, b,¢)n—1 =n. 
We will prove this below using a p-basis. Then let 
(A;, B;, C;) © Syz(a, b,e),i1 C RB_4, 1 <i <n, 


be a basis, where as usual R = k{s,t]. This gives moving lines Ajx + Byy + Ciz, 


which we rewrite using the monomials s"~!, s"~?t,..., st”~?, t”~! of degree n — 1: 
n-1 
Ayx+ By t+ Cjiz= > Lig (2, 2) Cen 
j=0 


We assemble the ¢;;(x, y, z) into an n x n matrix 


of linear forms. In 1995, Sederberg and Chen proved the following result. 


THEOREM 3.15. Let F = 0 be the implicit equation of C = ®(P') C P?. Then 
then x n matrix of linear forms M,,_1 defined above satisfies 


F%e(®) — det M,_1. 


We defer the proof until later. Note that the theorem is consistent with the 
degree formula in Theorem [3.13] since det M,,_; clearly has degree n. 

An interesting feature of Theorem [3.15] is that it uses a matrix built from 
syzygies, just as Theorem [1.18] in Section [2] did surface implicitization using a 
matrix also built from syzygies. 


METHOD 2: p-BASES. The matrix M,-1 in Theorem [3.15] uses n linearly 
independent moving lines, yet the picture suggests that we should need only 
two, namely those coming from a p-basis of Syz(a, b,c). This intuition is correct, 
as we now explain. 

Let the pi-basis be p = (pi, p2,p3) and q = (q1,q¢2,q3). By abuse of notation, 
we define the polynomials 


p=pix+ poy + p3z and q = qx + qay + q3z. 


Hence the moving lines are p = 0 and q = 0. 
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As polynomials, p and q are bihomogeneous of degrees pw and n — uu in s,t and 


degree 1 in x,y, z. To eliminate s,¢, we use the Sylvester resultant from Section [L.2] 
Regarding p, qg as homogeneous polynomials in s,¢, the resultant Res(p, q) has degree 


deg, .(p) + deg, (gq) =w+(n—-p) =n 
in x,y,z. In 1998, Cox, Sederberg and Chen proved the following. 


THEOREM 3.16. Let F = 0 be the implicit equation of C = ®(P!) C P?. Then 
a p-basis {p,q} of Syz(a,b,c) satisfies 


F%s(®) — Res(p, q). 


Proor. As we learned in Chapter [I] Res(p,q) vanishes at (x0, yo, zo) € P? if 
and only if there exists (so, to) € P+ such that 


P1(So, to)Xo + p2(So, to) Yo + p3(So, to) 2% = 
91(S0, to)#o + g2(so, to)¥o + 93(S0, to) 2% = 0. 


Arguing as in (8.23) and @.24), these equations tell us that up to a scalar multiple, 


p2(so, to) 
Xo = det 
bees 


q 
q. 
,to) (So, to) 
= — det P1(So, to ? =F t 
a ‘ fas q3(So; to) (80,0) 
q 
q 


to) 
= det P1(S0, to 
" ees 


Thus Res(p, g) vanishes precisely on C and hence is a power of F', where F' = 0 is 
the implicit equation of C. We saw above that Res(p, q) has degree n, and then we 
are done since n = deg(®) - deg(C) by Theorem B.13 


for 


The y-basis {p,q} also relates nicely to METHOD 1. We assume 1 
k Is, ¢], 


m 
simplicity. Since {p,q} is a basis of the graded R-module Syz(a, b,c), R 
we have 


Zs 


Syz(a, b, east = a= pd ® Ry-14- 


It follows that the n syzygies 


(3.26) hg gp gk pee ae td). 2 gt 8 


“3 


form a basis of Syz(a, b, c)n—1 as a vector space over k, proving (8.25). Furthermore, 
if we write 


p= aos" t+aysh—tt +--+ + a,th 


q= bos” # + bys” Ht feee a a 
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where ao,...,@y,00,---,bn—» are linear in x,y,z, then expressing (3.26) in terms 
of the monomials of degree n — 1 in s,t gives 


sr—l-Py ao ay ae ne oo 
=o s 
3? . Mtp ap ay sie Ap or 
n—-l-p ‘ 
pol ae eo _ Oy ghee 1 
Ss qd bo by use Dn—p : 
s—2tq bo by on bn-p : 
: st"? 
< . . . pro 
th-l¢ bo by # jabs: bn—p 


Hence the matrix M,_1 in Theorem [8.15}is the Sylvester matrix for the resultant 
we saw in Section[L.1] It follows that 


det M,,-1 = Res(p,q) = Fees(®) 


which proves Theorem [3.15] for the special basis (8.26) of Syz(a,b,c)n—1. The 
general case follows since changing to an arbitrary basis multiplies det M,_1 by a 
nonzero constant. 


The Rees Algebra. Given an ideal J in a commutative ring R, its Rees algebra 
is the graded R-algebra 


RL) = PB ime™ C Rel. 
m=1 
If l= (fi,..., fa), then x; + fie induces a surjection 


Rin,---,ya] 2 RID C Re]. 


Generators of the kernel K = ker() are called defining equations of R(I). When 
we use the standard grading on R[y1,...,Yya], we see that K is also graded, with 
kk; denoted the elements of K of degree j in y1,...,Ya- 

The Rees algebra of an ideal I = (f1,..., fa) has a nice relation to the syzygy 
module of its generators f;,..., fa: 


e (hy,...,ha) € Syz(fi,..., fa) if and only if hyyi+---+haya € Ki. It follows 
that Syz(f1,..-,fa) 2 K4. 
e Kk, generates K when f;,..., fq form a regular sequence in R. 
e The quotient ring R[yi,...,ya|/(1) is isomorphic to the symmetric algebra 
Symp(J), which will play a role in the next section. 
In what follows, we will focus on the Rees algebra of J = (a,b,c) C R = k{s, ¢]. 
Using the coordinates x, y, z of P?, we write the presentation of R(J) as 


(3.27) 0—> K — Riz,y,2] 6 R(D — 0, 
where 3 maps z, y, z to ae, be, ce € R{e}. 


Moving Curves and the Rees Algebra. The ideal of defining equations K in 
(8.27) was discovered by the geometric modeling community, as we now explain. 
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Let B € k{s,t;x,y, 2] be bthomogeneous of degree i in s,t and degree j in x,y, z, 
written deg(B) = (i,j). Then: 
e For a fixed (s,t) € P!, the equation B(s,t, x,y,z) = 0 defines a curve in P? 
of degree 7. 
e When we vary (s,t) € P', we get a moving curve of degree j in P?. 
In [314], Sederberg and Chen consider moving curves that follow the parametriza- 
tion, meaning 
B(s,t,a(s,t), b(s,t), c(s,t)) = 0. 
Since B is homogeneous of degree j in x,y, Zz, this implies 
0 =e) B(s, t, a(s,t), b(s, t), c(s, t)) = B(s,t, a(s, t)e, b(s, t)e, c(s, t)e) = B(B), 
where ( is from (3.27). Hence B € K;, which easily leads to the following result. 


LEMMA 3.17. If B © R[x, y, z] is bihomogeneous of degree (i,j), then the moving 
curve B = 0 follows the parametrization ® = (a,b,c) if and only if B € K;. 


Computing the Defining Equations. In the larger ring Re, x, y, z], the map 
Rie; L,Y, z] —~ Rie] 


sending e,z,y,z to e,ae,be,ce has kernel (x — ae, y — be, z — ce). It follows that 
K =ker(6: Rix, y, z] > R(L) C R[e]) is the intersection 


K = Riz, y, 2] (a — ae, y — be, z — ce) C Rie, a, y, 2]. 
Since R = k[s,t], this becomes 
K =Kk{s,t,2,y, 2] N (a — ae, y — be, z — ce) C ke, s,t, x, y, 2], 
which is a problem in elimination theory. Here is an example. 
EXAMPLE 3.18. For the parametrization with n = 4 given by 
& = (a,b,c) = (¢*, st’, s* — st), 
a computation in MACAULAY2 gives the minimal generators of K: 


at SR 


QQle,s,t,x,y,z, Monomial0rder => Eliminate 1] 


i2: J 


ideal (x - t*4*e, y - s°2*t°2*e, z - (s74-s*t73) *e) 
i3 : mingens ideal selectInSubring(1, generators gb J) 


03 = |txy-sy2t+sxz sx2-ty2+txz stx-s2y+t2z s2x-t2y x3y-y4+2xy2z-x2z2 | 
eee OE 
moving conics moving lines implicit equation 

Recall that (7,7) means degree 7 in s,t and degree j in 2,y,z. Then K has five 
minimal generators with degrees as follows: 

e Two moving conics of degree (1, 2). 

e Two moving lines of degree (2,1). Thus p = 2. 

e The implicit equation is a quartic. Degree (0,4) means that it is the moving 

curve that doesn’t move! 


We will soon see that these degrees are no accident when n = 4 and «= 2. The 
next section will present a slicker way of computing K in MACAULAY2. <— 
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Two Theorems about Minimal Generators. The first theorem is from [313], 
where Sederberg, Goldman and Du generalize Example [3.18] as follows. 


THEOREM 3.19. If a curve parametrization ® : P! aie P? has n = 4 and 
deg(®) = 1, then: 
(1) w= 2 tf and only if all singular points of C have multiplicity 2. 
(2) When pw = 2, the defining equations ideal K has five minimal generators: 

— Generators p and q of degree (2,1) (the p-basis). 

— Two moving conics C; and C2 of degree (1,2). 

— The implicit equation F = 0 of degree (0,4). 


A rigorous proof using local duality can be found in [96]. This paper also 
reveals the following “internal structure” of the minimal generators of K. The first 
step is to write the pi-basis in terms of s?, st, t?: 


p= s*lo(a, Y, z) Tr stly (a, Y, z) ar tPlo(a, Y, z) 


q= 700 (@, Y; Zz) TT stl) (a, Y; Zz) Tv le as Y; zi 
where fo,...,€4 are linear in x, y, z. Then compute polynomials C1, C2, F as follows: 
e Express p,q in terms of s?,t to obtain 
p= slo + t(sy + tls) 
q= sl, +t(sl, + tl) 
Note that C has degree (1, 2). 
e Express p,q in terms of s,t? to obtain 
p= 8(slo + te;) + t?l2) 
q = s(sly + tl.) +t7£4) 
Similarly, Cz has degree (1, 2). 
e Express C1, C2 in terms of s,t to obtain 
Cl = sQo(a, Y, 2) + tQi(z, Y, z) 
C2 = 8Qo(2, y, 2) + tQ1 (x, y, 2) 
Note that F' has degree (0, 4). 


Theorem 2.12 of shows that in Theorem 8.19] minimal generators of K are 
given by the p-basis p,q together with C1, C2, F as defined above. The last three 
polynomials are examples of Sylvester forms. Thus one can construct the defining 
relation ideal K, including the implicit equation, from the p-basis via iterated 
Sylvester forms. 

The second theorem shows that the case 4. = 1 follows a similar pattern. 


and define C, = det i ea fe 


ly sl + tls 


and define Co = det ie + thy Z| ; 


st, +tl, &, 


and define F' = det be | : 


QM 


THEOREM 3.20. If a curve parametrization ® : P+ She, P? has n > 3 and 
deg(®) = 1, then: 

(1) w=1 éf and only if C has a unique singular point of multiplicity n — 1. 

(2) When pw = 1, the defining equations ideal K has n+ 1 minimal generators 
consisting of the y-basis elements p,q of degrees (1,1) and (n — 1,1), together 
with polynomials q; of degree (n— i,t) fori = 2,...,n. Furthermore: 

— F = qy has degree (0,n) and gives the implicit equation of C. 
— If we set q = q, then for i = 2,...,n, q can be obtained as the Sylvester 
form of p and q_-1. 
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This result was conjectured in 2007 by Hong, Simis and Vasconcelos and 
proved by Cox, Hoffman and Wang a year later. 

Many people have worked in the area, including Busé, Cortadellas, D’Andrea, 
Jia, Kustin, Madsen, Polini, Ulrich and others. See the paper for recent results 
and references to the literature. There are also lovely relations to singularities and 
interesting generalizations to parametrizations P!' + P%. The paper has 
numerous useful citations, including the pioneering papers of Maria-Grazia Ascenzi 
from the late 1980s. 


Surface Parametrizations. It should be no surprise that the algebra involved in 
surface parametrizations is richer and more complicated than for curves. We begin 
our discussion of surfaces with an example that illustrates some of the new features 
to consider. 


EXAMPLE 3.21. Let A? denote 2-dimensional affine space, typically R? or C?, 
and similarly for A°. Then consider the parametrization 


Ss t st 
s2 +42’ 52 + ¢2’ 52 4 12 


(s,t) EA? (2,9, 2) = ( ea? 

For curves, the link to algebra worked best in the projective context. But what 
does “projective” mean in the surface case? Here are two ways to make the above 
map projective: 


D2 


(s,t, u) € P? +> (su, tu, st,s? +27) ¢ P 


3.28 
oe (s,u;t,v) € P! x P! -> (suv”,u7tu, sutv, s?v?+u7t?) € P?. 


In the language of SectionB.1] the first is a triangular parametrization of degree 2, 
while the second is a tensor product parametrization of degree (2, 2). 

Notice also the presence of basepoints in (8.28), which are parameter values 
where the parametization is undefined as a point in P?. One can compute that 


P? --» P? has basepoint (0,0, 1). 
p! x P! --+ P® has basepoints (0, 1;0,1) and (1,0;1,0). 


Even for a parametrization R? + R® (typical in geometric modeling), we will see 
that basepoints at oo or over C cannot be ignored. << 


In the affine setting, a surface parametrization is 


ae _ (a(s,t) b(s,t) e(s,t) 
®:A*--+ A®,  O(s,t) = Cant d(s,t)’ as.) 


where a,b,c,d € k[s,t] are relatively prime and k = Q,R or C. We will often write 
this as 


®:A?-->P?, O(s,t) = (a(s,t), 0(s, t), c(s, t), d(s, t)), 


denoted ® = (a,b,c,d). We always assume that the Zariski closure of the image of 
® is asurface S C P?. For simplicity, we focus on two ways to make this projective: 
e Triangular: ® : P? --+ P? is given by a,b,c,d € k[s,t, u], homogeneous of the 
same degree n. 
e Tensor product: ® : P! x P! --+ P® is given by a,b,c,d € k[s,u;t, v], biho- 
mogeneous of the same degree (m, 7). 


114 3. GEOMETRIC MODELING 


In each case, we get an ideal J and a basepoint locus B: 

e Triangular: I = (a,b,c, d) C k[s,t,u] with B = V(J) C P?. 

e Tensor product: I = (a,b,c, d) C k[s,u;t,v] with B= V(I) CP! x P?. 
For curves, the assumption gcd(a, b,c) = 1 implies that there are no basepoints, 
while for surfaces, gcd(a, b, c,d) = 1 guarantees that the basepoint locus B is finite. 


The Degree Formula. Similar to the curve case, the degree formula for a surface 
parametrization ® = (a,b, c,d) involves three types of degree: 
e a,b,c,d have degree n (triangular) or degree (m,n) (tensor product). 
e deg(S) is the degree of the irreducible homogeneous F' € k[x,y, z,w] with 
S=V(F). 
e deg(®) is the number of points in the domain that map to a generic point of 
S when working over C. 
In addition, each basepoint p € B = V(J) contributes to the degree formula via its 
Hilbert-Samuel multiplicity e,(1). We refer the reader to Chapter 12] for the 
definition of e,(Z). 
We are now ready to state the degree formula for a surface parametrization. 


THEOREM 3.22. If a,b,c,d € k[s,t,u] are relatively prime and homogeneous of 
degree n, then the parametrization ® : P? --+ S C P® satisfies 


deg(®) deg(S) = n? — x e,(I). 
pEB 
On the other hand, if a,b,c,d € k|s,u;t, v] are relatively prime and bihomogeneous 
of degree (m,n), then ®: P! x P! --+ S CP® satisfies 
deg(®)deg(S) = 2mn— 5° ep(L). 
pEeB 


A proof for triangular parametrizations can be found in [94]. This proof is 
easily adapted to tensor product case. 
EXAMPLE 3.23. For the affine parametrization 


S t st 
_ = 
524 t2 y 824+? 824 ¢ 


i — 


from ExampleB.21] elimination theory gives S C A? defined by x7z+y?z—2ry = 0. 
Hence deg(.S') = 3. Let’s compute this using the degree formula: 
e ©: P? --+ 9 C P* defined by ®(s, t, u) = (su, tu, st, s? +t") has degree 2 and 
deg(®) = 1. The basepoint p = (0,0,1) has e,(Z) = 1, so that 


deg(S) = 2? -1=3. 


e @: P' xP! --+ § C P? defined by G(s, u; t, v) = (suv”, u7tv, sutv, s2u?+u7t?) 
has degree (2,2) and deg(®) = 1. As for the basepoints, p; = (0,1;0,1) has 
€p, (I) = 1 and pe = (1,0;1,0) has e,, (I) = 4, so that 


deg(S) =2-2-2-1-4=3. 


These two computations of deg(S’) illustrate how the basepoints of ® can differ in 
number and complexity depending on how ® is represented. <— 
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EXAMPLE 3.24. Consider a bicubic parametrization ® = (a,b,c, d), such as the 
one pictured in (8.7). When a, b,c, d are generic of degree (3,3), one can show that 
B= V(a,b,c,d) =@ and deg(®) = 1, so the parametrized surface S has degree 

deg(S') =2-3-3= 18. 
Recall from Section[L.2]that geometric modelers first computed this number via ray 
tracing. It is nice to see how algebraic geometry gives the same answer. 

It follows that the implicit equation F = 0 of S$ will typically have (7) = 1330 
terms. This is small by modern standards but not something one wants to write 
out explicitly. << 


Syzygies and Moving Planes. Syzygies, moving lines, and p-bases are useful 
for curves. This is true for surfaces, with some complications. Let ® : A? --+ P® be 
given by relatively prime polynomials a,b, c,d € k[s,t]. Then a moving plane in P? 
Az + By+Cz+Dw=0, A,B,C,D€k{s,t], 
follows ® if and only if Aa + Bb+Cc+ Dd = 0. Thus moving planes correspond 
to elements of the syzygy module 
Syz(a,b, c,d) C k[s, t]*. 
A first complication is that when we work homogeneously, the syzygy module in 
k[s,t, u]* or k[s,u;t,v]* is rarely free (see, for example, Theorem 5.1]). But it 
is always free in the affine setting, as proved in [76]. 
THEOREM 3.25. For a,b,c,d € k{s,t] with gcd(a,b,c,d) = 1, the syzygy module 
Syz(a, b,c, d) is a free k[s, t]-module of rank 3. 
A basis {p,q,r} of Syz(a, b, c,d) C k[s,t]* is called an affine p-basis, which we 
write either as the matrix 
vi) A B CC OD 
(3.29) q|-|4 BY Cc’ Dp 
r A” B" (Og Dt 
or as linear forms 
p=Axt+By4+Cz24+dDw 
(3.30) q=A'x+Byt+Cz2z+Dw 
r= A" + BUy+C"2z4+ Dw 
with coefficients A, B,...,C”, D” € k[s,t]. Here is an example. 


EXAMPLE 3.26. ®(s,t) = (2st, 2t,2s,s? +t? +1) has affine p-basis 
p=@-sy 
(3.31) q= ( +1)y + stz — 2tw 
r= sty+(s*+1)z—2sw. 
As a map P? > P® (no basepoints in this case), the parametrization is 
G(s, t, u) = (2st, 2tu, 2su, s? + t? + u?). 


However, the homogeneous syzygy module in k[s,t, u]* is not free—one can check 
using MACAULAY2 that it has five minimal generators! << 


In the affine situation, we also have the Hilbert-Burch Theorem. 
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THEOREM 3.27. The 3 x 3 minors of (8.29) are a,b,c,d up to a constant. 


The Resultant of a y-Basis. For curves, TheoremB.16]tells us that the resultant 
of a p-basis is the implicit equation when deg(®) = 1. We now explore the extent 
to which this remains true in the surface case. Given an affine p-basis {p,q,r} C 
k[s,t]*, there are three related issues to consider. 
ISSUE 1: Resultants require homogeneous inputs. Hence: 
e For a triangular parametrization 6 : P? + P*, make the polynomials p, q,r 
in (8.30) homogeneous with respect to s,t,u. The resultant is 
Res(p, qd, r) = Ress,t,u (p(s, t, u), a(s, t, u), r(s, t, u)), 
where Res, ;,, is the classical resultant from (1.11) in Section 
e For a tensor product parametrization ® : P! x P! > P°, make p,q,r biho- 
mogeneous with respect to s,u and t,v. The resultant is 
Res(p, g, 7) = ReSs,u:t,v(p(s, ust, v), o(s, ust, v), r(s, ust, v)), 
where Ress,u:t,. is the sparse or Dixon resultant discussed in Section [2] 
Res(p, g,1) is homogeneous in x,y, z,w and vanishes on S$ as we will soon see. 
ISSUE 2: Basepoints have a role to play. By Hilbert-Burch, a, b,c, d € k[s,t] are the 
maximal minors of the 3 x 4 matrix (8.29) with rows p,q,r. Hence 
(so, to) € A® is a basepoint + > (29) has rank < 3 at (so, to) 


3.32 
een) <=> (8.29) has rank = 0,1,2 at (so, to). 


Basepoints can lead to extraneous factors in Res(p,q,1) as we will see below. 


IssuE 3: As the name implies, an affine p-basis {p, q,7r} gives no information about 
how the corresponding moving planes behave at oo (meaning P?\ A? or P! x P!\A?). 
There are two things that can happen at oo: 

e Basepoints can occur at oo, sometimes leading to extraneous factors. 

e Failure to generate the syzygy module at oo can lead to extraneous factors. 


EXAMPLE 3.28. Written in homogeneous coordinates, Example [3.26] gives the 
parametrization 


G(s, t,u) = (2st, 2tu, 2su, s? + t? + u?). 
As previously noted, there are no basepoints. Yet the p-basis has resultant 
Res(p,q,7) = y' F(x, y, 2,w), 
where F = 0 is the implicit equation of the surface S$ C P?. The extraneous factor 
y’ arises from the failure of (8.31) to be a p-basis at oo. < 


Here is an example where the resultant is useless. 


EXAMPLE 3.29. The affine parametrization 


s4+tt s*t 8? st? ss 
= — i—4 a = 
gp? Ys Bt 
has Res(p, q,r) = 0. We will see in Example B.31] below that this happens because 
(s,t) = (0,0) is a really bad basepoint. < 
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A First Look at Res(p,q,7r). In spite of the problems just noted, Res(p, q,1r) 
has some nice properties. To understand why, observe that Res(p, q,7) vanishes at 
any point (2, y,z,w) where the equations 


p=Azr+By4+C z+Dw=0 
(3.33) q= A r+ Byt+Cz+Dw=0 
r= A"rx+ B/y+C"z4+ D"w=0 


have a solution s,t. Since (x,y, z,w) = ®(s,t) is a solution, we have the following. 
THEOREM 3.30. Res(p,q,7r) vanishes on the parametrized surface S C P3. 
Furthermore, if (s,¢) € A? is not at a basepoint, then the matrix 
Pp 
q|=|A’ B’ Cc’ D 
r Al B" Git Dp" 


has rank 3 at (s,t) since a,b,c,d are its 3 x 3 minors by Hilbert-Burch. This 
shows that away from the basepoints, the moving planes have a unique point of 
intersection: 


This point must be on S since the planes follow the parametrization. Thus we 
recover the surface analog of the picture for curves shown in (3.22). 

However, Examples and [3.29] show that Res(p,q,7) may have extraneous 
factors or vanish identically. We now give an informal discussion of how basepoints 
affect the resultant. 


Behavior at Basepoints. At a basepoint, the parameter values “blow up” to an 
exceptional curve on the surface S C P’. Exceptional curves come in three flavors: 
e A line. 
e A plane curve. 
e A space curve. 


On the other hand, at a basepoint, we noted earlier that the matrix 


D A B CC OD 
q — A’ B’ C’ D: 
r A” B" (Gis Dp" 

has rank < 3, so the rank is 2, 1 or 0. A wonderful fact is that the three flavors of 


exceptional curves correspond to rank = 2,1,0. Let’s explore each of these, starting 
with the nicest. 
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At a Rank 2 Basepoint. Here, the moving planes intersect in a line: 


exceptional curve 


The line lies on the surface and is the exceptional curve. There is no extraneous 
factor to worry about. 
We also have a nice connection to commutative algebra: 
p = (s,t) is a rank 2 basepoint <= the localized ideal (a,b, c,d) C Oa2 » 
has two minimal generators. 


In other words, a basepoint has rank 2 if and only if the the ideal (a,b,c,d) is a 
local complete intersection (LCI) at the basepoint. 


At a Rank 1 Basepoint. Here, the three moving planes coincide: 


exceptional curve 


The exceptional curve is a curve lying in this plane. The resultant acquires an 
extraneous factor that is the equation of the plane raised to some exponent. 

To explain the exponent, we need more commutative algebra. At a basepoint 
p = (s,t) of the ideal I = (a,b, c,d), we have two numerical invariants: 


e The Hilbert-Samuel multiplicity e,(I) that appears in the degree formula. 

e The degree d,(I) = dim O,2_,/(a, b, c,d). 
These invariants satisfy 

ep(1) 2 dp (J), 

with equality if and only if p is LCI. We also know that 

p = (s,t) is a rank 1 basepoint <= the localized ideal (a,b, c,d) C Oa?» 

has three minimal generators. 

This is called a local almost complete intersection. At a rank 1 basepoint p, the 
extraneous factor of Res(p,q,1) is given by 


ep (I)—dp(I 
Lop ao(D), 
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where L, = 0 is the plane defined by the p-basis at the basepoint. The exponent 
e,(I) — d,(I) > 0 is a numerical measure of how close a local almost complete 
intersection is to being LCI. 


At a Rank O Basepoint. Here, the moving “planes” are the ambient space since 
every coefficient vanishes. In this case, the exceptional curve is a space curve. At 
such a basepoint, the equations (3.33) have a solution for any (a,y,2z,w), so that 
the resultant Res(p, q,7) vanishes identically. Finally, 
p =(s,t) isa rank 0 basepoint ++ the localized ideal (a,b, c,d) C Og2,y 
has four minimal generators. 
Rank 0 basepoints have no special name beyond the fact that they are really bad. 
EXAMPLE 3.31. The parametrization 
s° +t4 _ st 8? _.sr 8 
p UO pw eB | 
from Example has a basepoint at (s,t) = (0,0). Here, the ideal is 
(Sat Stel Pee stat £\ ait, 


Ha 


which requires four minimal generators, even after localization. This shows that 
(0,0) is a rank 0 basepoint and explains why Res(p, q,1) = 0. < 


What We Have and What We Want. Summarizing the above discussion, we 
see that if all basepoints are at worst local almost complete intersections, then the 
resultant of an affine p-basis can be written 


Res(p,q,r) = Frieat®) 5 II Lep(t)—de(2) x extraneous factor at oo, 
& rane known by and 
what we want 

where the product is over all rank 1 basepoints and F' = 0 is the implicit equation 
of the surface S. The theory behind this formula is explained by Busé, Chardin 
and Jouanolou [58], who also analyze the extraneous factors at oo in the triangular 
case. Extraneous factors at oo in the tensor product case are described by Shen 
and Goldman [819]. Their paper also includes some lovely examples. 

The drawback of this approach is that yi-bases and resultants are not easy to 
compute, and there are often extraneous factors at co we don’t want. As indicated 
above, what we really want is the product 


(3.34) Pres(®) y iil Eee) del) 
rank 1 
We next give an alternate approach to this formula based on matrix representations 
(also called implicitization matrices) built from moving planes of fixed degree. 
Matrix Representations. We begin with a definition. 
DEFINITION 3.32. A homogeneous polynomial H € k/x,y,z,w] is represented 
by a matriz M of homogeneous polynomials in k[x, y, z, w] when: 


(1) The rank of the matrix M drops precisely when evaluated at the points of the 
surface in P? (over k) defined by H = 0. 
(2) H is the gcd of the maximal minors of M. 
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In [58], Busé, Chardin and Jouanolou explain how to construct a matrix that 
represents (3.34) in the triangular case. Let a,b,c,d € k[s,t,u] be relatively prime 
and homogeneous degree of degree n. For an integer v, let S),..., Sy be a basis of 
Syz(a,b, c,d), C k{s,t, u]4. This gives N moving planes 


£,=S;-(2,y,2,w), t=1,...,N. 

As we have done several times before, express £; as a polynomial in s,t,u whose 
coefficients are linear forms in x, y, z,w. These linear forms give a matrix M, with 
the property that 

: Li 

My | 5%¢8y7| = |: 

where the column vector on the left consists of all monomials of degree vy in s,t, u. 
The following result from shows that M, is the matrix we want. 


THEOREM 3.33. Assume that k algebraically closed and that the basepoints of 
® = (a,b,c,d) are at worst local almost complete intersections. If v > 2(n — 1), 
then the above matrix M, represents the polynomial Fs) % Mie 4 pea) 

This result dates from 2009, though the idea of dropping rank goes back to 
van der Waerden’s 1926 proof of the Fundamental Theorem of Elimination Theory 
351|. Here are two further comments about the paper [58]: 


e The proof of Theorem [3.33] in uses the approximation complexes of Her- 
zog, Simis and Vasconcelos [202], which were first applied to implicitization 
in 2003 by Busé and Jouanolou [64]. See Section [3.3] for more details. 

e The result in generalizes to parametrizations P™ --» P™*!. The curve 
case is m = 1, where the matrix M, works for v >n—1. Ifv=n-—1, then 
M, = M,,_1 is the square matrix that computes Res(p, g) in Theorem B.15] 


Applications of Matrix Representations. In 2009, Luu Ba, Busé and Mourrain 
used matrix representations and numerical linear algebra to study intersec- 
tions of a curve and a surface. This was extended to the intersection of two surfaces 
by Busé and Luu Ba in [56]. 

There is also a version for P! x P! or more generally for ® : Xp --+ P® where 
Xp is the toric surface of the Newton polygon P of a,b,c,d. This brings us back 
to a result mentioned in the Preview of the 21st Century at the end of Section [1.2] 
In the notation of this section, the goal is to eliminate s,t from 


_ a(s,t) _ W(s,t) —_ c(s, t) 
~ ad(s,t)’ %~ d(s,t)’ ~~ d(s,t)’ 
Pick a lattice polygon P C Z2, that contains the Newton polytopes of a,b,c,d. If 


Xp is the toric surface of P, then a,b,c,d give a parametrization 


®: Xp --> P*. 


(3.35) 


To find the implicit equation F' = 0 of the image, we proceed as follows. The set of 
all syzygies of Aa + Bb + Cc + Dd = 0 where the Newton polytopes of A, B,C, D 
are contained in 2P is a finite-dimensional vector space that can be computed by 
linear algebra. A basis of such syzygies gives polynomials 


LY = AM(s, t)a + BO(s,t)y + CO(s,t)z + D(s,t)w, 1=1,...,N, 
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which we rewrite in the form 


l inj 
= ST Oneyeue'h, 121.0, 
(4,j)€2PNZ? 


for linear forms D5 @52 0). When we arrange the ae into a matrix M of 


linear forms, we have the following result of Botbol and Dickenstein [41]. 


THEOREM 3.34. If all basepoints in Xp are LCI, then the matrix M represents 
Fes ®) | where the implicit equation F = 0 of the surface S parametrized by (8.35). 


This is Theorem [1.18] from Section [2] though in the earlier version, we had 
to say “in nice cases” rather than “LCI basepoints” and say “drops rank precisely 
when evaluated at points on the surface” rather than “represents F4°s(®).” 


Final Comments. The algebra of geometric modeling has a very different flavor 
from the geometry discussed in Section B.1] We met some significant new players, 
including basepoints, multiplicities, syzygies, and Rees algebras, along with links 
to earlier topics such as elimination theory, resultants, and toric varieties. 

As you will learn in the next section, the algebra keeps getting better. 


3.3. Rees Algebras, Syzygies, and Computational Geometry 
by Hal Schenck 


Our goal in this section is to describe in more depth the ideas from earlier in 
the chapter and introduce the concepts needed to unleash their full power: 
Rees and Symmetric Algebras. 
Local Cohomology and the Jacobian Dual. 
Fitting ideals and the Annihilator of a Module. 
Free resolutions and MacRae Invariant. 
The Approximation Complex. 
Multigraded Implicitization. 
Extensions and Future Directions. 


The setup is that the ideal 
T=(fiyssiysfa) CR= kleine: tl 


is generated by homogeneous elements of degree n, which define a map of rings 


(3.36) S=kly,-..,yal] 9 R, yi fi 
This in turn gives rise to a corresponding map of varieties 
pm-\ B®, pel 


where the base locus B = V(I) consists of the points where ® is undefined. Our 
focus will be on the case where d = m+1 and ® generically finite onto its image, so 
the image of ® is a hypersurface H = V(F’). We begin with a motivating example. 


EXAMPLE 3.35. Let 1 = 21 +22 +23 and 


[= (212203, 21 Ll, v1 x3l, LQu3l) = R= klay, 2, U3]. 
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The four lines V(#1), V(x2), V(x3), V(1) depicted here 


determine the base locus 8, which consists of the six intersection points 
B = {(0,1,0), (0,0, 1), (0,1, —1), (1,0, —1), (1, —1, 0), (1,0, 0)}. 


The ideal J has a presentation given by 


l 0 0 
—2X3 x3 0 
0 = 9 v2 
SL [m1 2203 TX2l 212031 x2x3l] 


0 R(-4)° > R(-3)* 


The leftmost matrix consists of the syzygies on the ideal J: polynomial relations on 
the 1 x 4 matrix of polynomials generating I. If we denote the basis vectors for R* 
as {y1,... ya} then we may rewrite the syzygies as 


> 0. 


ly — ©3Y2, ©3Y2 — L2Y3, T2y3 — T1Ya- 


Now we turn this inside out, and write the three expressions above (recall that 
1 =a, +22+ 23) as column vectors with respect to the basis {71,72,73} of Ri: 
YI 0 Ya 
B=| wh —¥s Ys 
Yi-Y2 Ye 0 
Let 
F = det(B) = yoysya — yiysys — yiyoya — Yiy2ys- 

An easy computation shows that F(f1, fa, fs, fa) = 0, so that the determinant of 
B is the implicit equation for the hypersurface H. 

It is instructive to examine the geometry of this example more closely. It is a 
classical fact §V.3] that the blow up X of P? at six general points may be 
embedded as a smooth cubic surface in P?, using the divisor 


6 
D=3E)—)_ Ei, 
4=1. 


where Eo is the proper transform of a line and EF; is the exceptional curve over a 
blown up point. The intersection pairing on X satisfies 
Ha=1 B2=-Litt40 BE, =0 it xj. 


In our example, the points are not general: for example, the line L = V(a1) contains 
the 3 basepoints (say p1, p2,p3), and we compute 


6 
D-L = (3E9— _ Ej) - (Ep — Ey — Ey — Ex) = 3-3 =0. 
4=1 
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This computation shows that any of the four lines which contain three points of 
the base locus is contracted to a point; in particular we expect V(F') to have four 
singular points. The Jacobian ideal of F is generated by 


Y2y3 + Yoya + Y3Y4 
Y1y3 + Y1Y4 — Y3Y4 
yiy2 + Y1Ya — Y2Ya 
Y1y2 + Y1Y3 — Y24¥3 
which has zeroes at exactly the four coordinate points of P?, confirming our expec- 
tation that H = V(F) has four singular points. << 


The remainder of this section will be devoted to understanding the tools and 
techniques that are used to produce the implicit equation F’, in particular to explain 
the mysterious switch between the matrix of syzygies of J, and the matrix B. 


Rees and Symmetric Algebras. In the previous section, we defined the Rees 
algebra R() of an ideal I = (f1,..., fa) C R; it is the image of the map 
Rly... ya] > Rie] via yi five. 


If R is a domain, then since R(I) is isomorphic to a subring of Rie], then R(Z) will 
also be a domain. If we grade by the variables y; and let K = ker(3), then: 


Ky = Sayre: dia fi =0 <=> (aj) € Syz(L) 
Kg = > Aig Yi e? . Ss aig Sif; = 0 (ai) € Syz(I?) 
K3 = —= (aise) € Syz(I?) 


It follows from the definition that R(J) ~ Rlyi,...,ya]/. A close cousin of R(I) 
is the symmetric algebra Symp(L), defined as R[y1,..., ya]/(A1). It turns out that 
computing R(L) is a difficult problem, discussed at length at the end of the chapter. 
The symmetric algebra Sym p(J) is simpler; in the special case where kK = (ker({)1), 
the ideal J is said to be of linear type. 


EXAMPLE 3.36. Example [3.18] computes the ideal defining the Rees algebra 
R(1) of I = (t*, s?t?, s4 — st?). The MACAULAY2 package REESALGEBRA by David 
Eisenbud allows us to compute R(J) as follows (some output suppressed): 


i1 : R=ZZ/31991[s,t]; 
i2 : I=ideal(t°4,s72*t*2,s°4-s*t7*3) ; 
i3 : syz(gens I) 
03 = {4} | s2 -st | 
{4} | -t2 s2 | 
{4} | 0 -t2 | 


3 2 
03 : Matrix R <--- R 


i4 : RlI=reesIdeal I; 


04 : Ideal of RIw,w,w id 
0) 1 2 
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i5 : transpose gens RI 


o5 = {-1, -6} | w_Ost-w_1s2+w_2t2 | 
{-1, -6} | w_Os2-w_1t2 | 
{-2, -9} | w_Ow_1t-w_1°2s+w_Ow_2s | 
{-2, -9} | w_O°2s-w_1°2t+w_Ow_2t | 
{-4, -16} | w_0°3w_1-w_1°4+2w_Ow_1°2w_2-w_0°2w_272 | 


i6 : isLinearType RI 


o6 = false 


The two syzygies on I appear as the first two generators for the ideal defining R(J), 
and we see that Symp(J) and R(L) differ. << 


It is a useful exercise (or see Exercise A2.6]) to show that if R is a domain, 
then the kernel of the surjection from Symp(J) to R(Z) consists of those elements 
of Symp(Z) which are annihilated by R: the kernel of Symp(I) > R(J) is the 
R-torsion of the symmetric algebra. 


Local Cohomology and the Jacobian Dual. In general the linear type con- 
dition is rare: Symp(J) typically has R-torsion. Busé and Jouanolou proved 
that if the base locus is a zero-dimensional local complete intersection, then the 
R-torsion is rather mild, which can be phrased in terms of local cohomology. 


DEFINITION 3.37. For an ideal J and R-module M, the zeroth local cohomology 


H?(M)={me M|I?.m=0 for some j € N}. 


Letting the role of I be played by the ideal m = (21,...,2n), there is a standard 
four term exact sequence in local cohomology (Theorem A4.1 of ): 


0 —> H2(M) — M — @ H°(M(i)) — H}(M) — 0. 
i€Z 
If we use Symp(I) for M in the above sequence, then if J is linear type outside 
m, this means R(/) and Symp(J) define the same sheaf outside V(m), and R(Z) is 


the third term in the sequence above. The surjection from Symp(JI) to R(L) forces 
H} (Symp(J)) to vanish, and so we obtain the exact sequence 


(3.37) 0 — H? (Symp(I)) — Symz(I) R(T) — 0. 
Recall that the annihilator Ann(M) of an R-module M is the ideal 
{ré€R|r-m=0 for all me M}. 


A pair of theorems due to Busé and Jouanolou are the key to using syzygies 
to study implicitization. Regarding R(Z) as a graded R-module, here is the first. 


THEOREM 3.38. Ford: S > R from (3.36), ker(¢) = Anng(R(L)o).- 
PROOF. First, note that 
ker($) = KNS, 
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where K = ker(G: R@S —> R(L) C Rlie]). This follows since if F' € S' satisfies 
F(fie,..., fae) =0 € Rel, 

then specializing to e = 1 shows AS C ker(¢). On the other hand, since the f; 

are homogeneous, this means that ker(¢) is also homogeneous, so F'(f,e,..., fae) = 

e™. F(f,,..., fa) = 0, which implies ker(¢) C KNS. Since R(I) ~ R@S/K, FES 

annihilates the degree zero (in R) component of R(I) exactly if 
F-1¢€K, 

which holds exactly when F € KMS = ker(@). 


Caveat: if we do not assume that R is a polynomial ring, then an additional hy- 
pothesis, that H®(R) = 0, is necessary for the theorem to hold. Similar arguments 
yield the following. 


THEOREM 3.39. If H}(Symp(J)), =0 for all b >a, then 
Anng(Symp(I),) C ker(¢) for all b> a, 
and equality holds if I is of linear type outside m. 
If I is not of linear type, then the next best case is when the defining ideal of 


R(I) has the expected form. To explain this terminology, we need the notion of the 
Jacobian dual. 


DEFINITION 3.40. Let N denote the matrix of first syzygies on the ideal J = 
(fi,-.-> fa) C R=kla1,..., 2m], and write 
[tis «stig “N= [iyo B, 


where the entries of B are linear in the y; variables. The matrix B is the Jacobian 
dual of N. If the defining ideal of R(I) is generated by [y1,..., ya]: N and the dx d 
minors of B, then R(J) is said to be of expected form. 


The mystery matrix B that appeared in Example [3.35] is exactly the Jacobian 
dual. In Theorem 3.39] the hypothesis that J is linear type outside m means that 
in high enough degree b in the R variables, 


Hp (Symp (Z))o =0 
and hence in high enough degree, Sym,(J) and R() agree. Theorem B.39] provides 
a path to connect implicitization problems to Fitting ideals and the MacRae invari- 


ant, and as we shall see, gives a concrete means to efficiently compute the implicit 
equation F. Example B.35]is of expected form; Example B.52] below is not. 


Fitting Ideals and the Annihilator of a Module. A standard construction 
in algebra is that of the Fitting ideal of a module. For us, the context is that of a 
polynomial ring R and finitely generated R-module M. Then 


DEFINITION 3.41. Given a presentation 


RP”, RI» M —+0. 


for M, Fitt;(M/) is the ideal of g—% by g—i minors of w; in particular Fitto(/) is 
the ideal of g x q minors of w. 


It is a good exercise to show that the Fitting ideals of MW are independent of 
choice of w; they capture information about the annihilator of the module. 
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THEOREM 3.42. Fitto(M) C Ann(/). 


ProoF. If p < q, then Fittg(M/) = 0 since w has no q x q minors. So assume 
p> q. Choose aq xq submatrix A of w such that det(A) 4 0, yielding a commuting 
diagram 


R! M' 0 
Jb 
RP > RI >M > 0. 
Since M is a quotient of M’, relations on M’ are also relations on M and 
Ann(M’) C Ann(M). 


We need to show that det(A) kills M’. If a € M’ = coker(A) and b € R? maps to 
a, then 


det(A)-a=0 <= det(A)-b= Az for some x € R%, 
and such an x can be found by Cramer’s rule. 


By Proposition 20.7 of [139], if 7 can be generated by r elements, then we 
have that Ann(M)" C Fitto(/). This implies that the radical ideals satisfy 


/Fitto(M) = /Ann(M). 


Theorem [3.38] and Theorem [3.39] show that finding the implicit equation is a prob- 
lem of computing annihilators, hence Fitting ideals will play a role. 


Free Resolutions and the MacRae Invariant. 


DEFINITION 3.43. A sequence of modules C; and morphisms d; 


d; . dj- 
Croce > Ci44 SC) Be. Saks 
is a complex if im(d;+1) C ker(d;) for every i. It is exact at position ¢ if im(dj+1) = 
ker(d;). The i“ homology module is defined as H;(C) = ker(d;)/im(di+1), hence 
H;,(C) = 0 if and only if C is exact at position 7. 
An important special case occurs when the complex has the form 
dn—1 
> 


Le ea ay SG ey ee ee 
where the modules F; are free of rank r;, and the complex is everywhere exact. In 
this case F is called a free resolution of M; it is minimal if the d; have no invertible 
entries. 
By the Hilbert Syzygy Theorem (Corollary 19.7 of ), a finitely generated 
module M over the polynomial ring R has a finite free resolution, which stops in 
at most m steps, where m is the number of variables of R. 


EXAMPLE 3.44. Consider the ideal I = (x?, y?, 2?) C R= kiz,y,z]. Then the 
free resolution of R/I takes the form 


2? y? 2? 0 
-y? aa 0 2 
x 0 2 22 eye 2 
0 —> R(-6) >» R3(—4) ‘ mo adi ) I ul lp > R/I. 


Notice that the first syzygies are spanned by the obvious relations: for every pair 
(f,g) in I, there is a relation g- f — f-g = 0. Such a syzygy is called a Koszul 
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syzygy, and the free resolution above has a simple construction in terms of exterior 
algebra. 


DEFINITION 3.45. For any set of polynomials {f,,..., fa} C R, the Koszul 
complex is given by 
dnd _ d-1 nd 2nd 42, alpd _ pd 1 
0O—NR=R— SN OR >.» > ("R >A R* = R® —> R, 
with di(e;, \---Ae;,) = ya) es, A+++ A€j, A-++Ae;,). A straightforward 
computation ee that d;_; od; = 0. 


The Koszul complex is exact if and only if the f; are a regular sequence: for all 
i, multiplication by f; defines an injection on R/(fi,..., fi-1). Geometrically, this 
means that each V(f1,..., fi-1) has codimension 7 — 1 or equivalently intersecting 
V(fi,---,fi-1) with V(f;) drops the dimension by exactly one. For this reason, 
a variety whose ideal is defined by a regular sequence is known as a complete 
intersection. 

Now we turn to a special situation: let MZ be a module over a polynomial ring 
R, such that M is torsion: for each m € M, there is some nonzero r € R such that 
r-mz= 0. Let F be a free resolution for M. Since R is a domain we may localize 
F at the zero ideal, yielding an exact sequence of vector spaces, with the d; having 
entries in the field of fractions of R. Since the rank of F;, is r, we may factor d,, 


as | , with rank(a,) = Tn. 


By exactness the image of d,, is the kernel of d,_1, and so we may write d,_1 


as 
Bn—1 * 
* An-1 


with 6,,_, is of rank r,, and a,_, is of rank r,_, —Tr,. Iterate this procedure. Since 
M is torsion if and only if it has rank zero, we also have that 


n 


So(-1)'r; = 0. 


i=0 
Thus in the factorization of d, = [G1 | ai], the rank of ay = 11 —rg+7r3—-+: =1o0. 


DEFINITION 3.46. The MacRae invariant of M is 


det(az) - det( (a3) i-l 
M) = t(cj)—" 
aM) det(az) - det(a4) = ] [det a) 


This definition seems to depend on the choice of the a;. However, just as in 
the case of Fitting ideals, it turns out that the MacRae invariant is well-defined. 
For a proof of this, and the remarkable properties of the MacRae invariant below, 
we refer to the book of Northcott [283]. 


THEOREM 3.47. The MacRae invariant is 


Independent of the choices of a;. 

Multiplicative on short exact sequences. 

An element of R, rather than the field of fractions. 

Generator of the smallest principal ideal containing Fitto(M). 


DO oe 
Ne NaN 


( 
( 
(3 
(4 

If we hope to apply Theorem [3.47] above in conjunction with Theorem [3.38] 
or Theorem [3.39] we will need to obtain a resolution of R(J)o or Symp(I), as an 
S-module. This takes us to our next key tool. 
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REMARK 3.48. If F, is not exact but simply a chain complex, it is possible to 
define the determinant of F., see . We will not need this level of generality. 


The Approximation Complex. The canonical example of a chain complex is 
the Koszul complex which appeared in Example [3.44] The Koszul complex takes 
center stage as we describe the work of Herzog, Simis and Vasconcelos [202], 
on approximation complexes. Roughly speaking, the idea is to build a hybrid 
complex using S = k[yi,..., ya] and the ideal J = (fi,..., fa) C R=k[a1,..., 0m]. 


DEFINITION 3.49. In the setting above, let df be the i‘” Koszul differential 
on {fi,..., fa}, Z the kernel of df, and d? the Koszul differential on {y1,...,ya}- 
The approximation complex Z is 


dy d3 dy 
Z: +++ — > Zs) Op S —> Zi Op S > Zi-1 @x S —--- 


Like the Koszul complex, Z depends only on J and not the choice of generators. 
To see the maps are well defined, note that dfd5 +d5df = 0. Let y € ker(d/) = Z;, 
which implies d5df(y) = 0. Combining this with (dfd° + dS d/)(7) = 0, we see that 
df d° (7) = 0; hence 
d°(7) € Z;_1 @, S. 
Consider the rightmost homology of the approximation complex. Since Z, consists 
exactly of the syzygies on I, and Zp is R, we find 


Ho(Z) = R®, S/(Sy2(I)) ~ Symp(J). 


If H;(Z) = 0 for alli > 1, then the approximation complex will give a free resolution 
for Symp(JZ), and we will be able to apply the results on Fitting ideals and the 
MacRae invariant to obtain information about the implicit equation. 


THEOREM 3.50. The approximation complex Z is acyclic if and only if 
fit ERIC Pages cp Fe) =0 for alli € UN 1}; J > 0, 


In [57], Busé and Chardin use a spectral sequence argument to prove that if 
V(J) is zero dimensional, then the approximation complex is acyclic outside V(m) 
if and only if J is locally an almost complete intersection (ACI), that is, generated 
by at most one more equation than the codimension. In this case, Theorem [B.50] 
yields a free resolution of Symp(J). In the ACI case an extraneous factor appears 
in the MacRae invariant [58]; see Theorem [3.51] below. 

Of course, we do not need the whole free resolution; by Theorem B.39] we need 
the subresolution of Symp(I)a, where H2(Symp(J),) = 0 for all b > a. Busé and 
Jouanolou and Busé and Chardin show that when the base locus is empty, 
taking a > (n — 1)(m — 1) suffices, where n is the degree of the f;, and if the base 
locus B is nonempty, we can actually choose a > (n—1)(m—1)—e, where ¢ is the 
minimal degree hypersurface containing the base locus. 

Here is the precise result proved by Busé, Chardin and Jouanolou [58]. 


THEOREM 3.51. If the base locus B is ACT, then for any a > (n—1)(m—1)—e, 
the MacRae invariant of the approximation complex in degree a equals 


Feee®) xT] rex 
pEeB 


in the notation of (8.34). This simplifies to F*8(®) in the LCI case. 
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These methods are used in the proof of Theorem [3.33] in Section 3-2] 

In Example[3.35) we had n = 3 = m, so if J were basepoint free we would need 
the degree (3 — 1)(3 — 1) = 4 piece of the approximation complex. However, since 
the minimal degree curve through the six base points is a cubic, it suffices to look 
at the degree 4 — 3 = 1 graded piece of Symp(J). This explains what happened in 
Example B.35} the key takeaway is that basepoints make the computation simpler! 

Our next example also has a local complete intersection base locus, but illus- 
trates that typically we need more than two steps of the approximation complex to 
determine the implicit equation. 


EXAMPLE 3.52. The ideal J = (x7, x3) is a local complete intersection, and since 
I has a generator in degree two we need the degree two piece of the approximation 
complex. The map ¢ is determined by the cubics in I: [zi Lite 98 r703| 
whose syzygies are generated by the columns of 


—%2 —23 0 0 
Ly 0 —%3 —x3 
0 0 0 reli 
0 Ly v2 0 


which has degree two component generated by the image of 


1122 a L223 11423 L2X3 xe 0 0 0 
ae Ut ©1x3 0 0 0 1x3 se aes 

0 0 0 0 0 0 0 0 ae 
0 0 0 x? at. «2103 ve @9%3 0 


Multiplying [y1, ye, ys, ya] against M yields nine elements 
[—yreiwe + yort,.-.,—yord + ysary], 


and contracting against the quadrics in {21, 72,73} shows that (Z))2 is generated 
by the columns of 


yw 0 0 yw 0 0 0 0 ¥y 
—Y1 Yep 0 0 Ya 0 0 0 0 
_ | 0 0 yo —y O YA 0 0 0 
M((Z1)2) ~~ 0 —y1 0 (0) 0 0 Ya 0 —Yp2 


0 0. =H 0 =f OO = 0 

0 0 0 0 0 -y O -y O 
Next, we need the term Zp in the approximation complex. There are two ways 
to do this: on the one hand, because I is a local complete intersection, we could 
simply compute the kernel of the matrix above. Or we could consider the kernel of 
the second Koszul differential on J, which is generated by the columns of 


x3 a8 0 0 
0 —-atae 0 ates 
0 x? x a3 0 
—22 0 0) —x3 
Ly 0 —x3 0 
0 0 Stas: ae 


The quadratic component of the image is generated by 


u3(e1 A €2) — @2(€1 A e4) +: 41 (e2 A €4). 


130 3. GEOMETRIC MODELING 


Pushing this forward via the Koszul differential yields 
23(a¥eq — 27 2%2e1) — to(xPeq — e}x3e1) + 21(x7x2e4 — z}aV2€0). 


Multiplying this by {x,,22,x3} to obtain the quadratic component and express- 
ing the result in terms of the ordered basis above shows that Z2 has quadratic 
component spanned by the columns of 


YA 0 0 
0 YA 0 
—y 0 Y4 
~Y¥2 0 0 
Yr yo =O 
0 0 —Y2 
0) Y1 0 
0 0 Y1 
0 0 0 


Now that we have the approximation complex, we compute the MacRae invariant 
to get the implicit equation via Theorem[3.51] The determinant of the topmost 3x3 
block of this is y?, and the determinant of the complementary (hence, rightmost) 
block of the matrix M((Z1)2) is y?- (y3 — y?y3), so the implicit equation is given 
by 

ya: (Ys — Yiys) 

Yi 

which is easily checked to be correct. << 


= ¥3 — YiYs, 


Multigraded Implicitization. A case which has attracted much interest in geo- 
metric modeling occurs when the ring R is multigraded. The hypersurface case 
which is the focus here was studied by Botbol [39], and we now describe this in 
more detail. 

The most familiar example of a multigraded map occurs when the map ¢ is 
given by monomials. In this case the image is a toric variety and it is possible 
to compute the implicit equation (or equations if the image is not a hypersurface) 
using simplicial homology. Let the monomials have exponent vectors 


A = {a1,..-, aa} CZ” 


and let A be the matrix with i” column a;. If Y4 C k® is the affine toric variety 
parametrized by the monomials, then Proposition 1.1.9] tells us that vanishing 
ideal of Y4 in k[y1,..., ya] is the toric ideal 


Ia = (y* —y? | a, B € N? and a — 6 € ker(A)) C S = klyi,..-, yal. 


The ring k[y1,..., ya] is multigraded by Z”, where the degree of a monomial y® is 
Aa € Z". The definition of 14 guarantees that [4 is homogeneous with respect to 
this multigrading. The graded piece in degree m € Z” is denoted (I4)m. 

To any m € Z” we associate a simplicial complex 


Am = {J C {1,...,d} | m— 5a; lies in NA}. 
ieJ 
A result of Hochster shows that 
Ho(Ans k) = dim (14) ma: 
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In particular, the generators of the ideal [4 are given by the zeroth reduced homol- 
ogy of certain simplicial complexes built from the semigroup NA. 


EXAMPLE 3.53. Let A C Z? be generated by 
3 2 1 0 
0 1 2 3)° 


We think of these lattice points as vertices v1,...,v4. Then a computation shows 


ie we{(2),(3),()}. 


we have Ho(Am; k) ~ k. For example A/s, has vertices v;,v2,v3 and edge v1 v3. 
2 
Computing, we see that the ideal [4 has three generators 


ys —yiy3 in degree (3) 
yi¥4 — Y2y3 in degree (3) 

y3—y2ys  indegree (4). 
Hochster’s result is more general, and shows that the higher syzygies of [4 may also 
be computed in similar fashion: first syzygies correspond to elements of Hi (Am, <k), 
and so on. In our example, a computation shows that 

Hi(Am,k) =k 

if and only if 


me { @) ; (5) \ , so there are exactly two first syzygies on I. 


For more details and an in-depth exposition, see ; << 


In the more general setting of a multigraded ideal J which is not generated by 
monomials, the approximation complex machinery works by taking a sufficiently 
high multidegree subcomplex of Z. The multigraded structure means that the local 
cohomology must be computed with respect to a different ideal. 

Recall that in geometric modeling, the most common surfaces are triangular 
surfaces, which correspond geometrically to P?, and tensor product surfaces, which 
correspond geometrically to P! x P+. Algebraically, a tensor product surface comes 
from a bigrading on R = k/s,t,u,v], with s,t of degree (1,0) and u,v of degree 
(0,1). To make multigraded implicitization concrete, we focus on tensor product 
surfaces. 

Let Rm n denote the graded piece of R in bidegree (m,n). A regular map 


Pp! x P! -°; P3 is defined by four polynomials 
{io Dis Po; Ba} CS Rigg 
with no common zeros on P! x P!. The empty locus on P! x P! is defined by the 
bigraded ideal 
m = (s,t)N (u,v) = (su, sv, tu, tv). 
Therefore the map ® is basepoint free map if and only if the ideal 
I= (Po, Pi; P2,P3) 
satisfies V7 = m. 


132 3. GEOMETRIC MODELING 
EXAMPLE 3.54. Consider the ideal 
I = (s?u, s?v, t?u, t?u + stv), 


generated by four elements of bidegree (2,1). The syzygies on I are generated by 
the columns of 


—v —t? 0 0 —tu 
U 0 —st—# 0 0 
0 s? 0 —svu—tv —sv}’ 
0 0 s? tu SU 
which we encode as 
uYy2 — VY1 = 0 
s?y3 — ty, = 0) 
s*y4—(st+t7)yo. = O 
tuys —(su+tv)yz3 = 0 
suys — Svy3 —tvy, = 0. 


If we were in the singly graded case, we would need to consider degree 2, and a basis 
for Z? consists of {s,t,u,v}-wy2 — vyi, and the remaining four relations. With 
respect to the ordered basis {s?, st, t?, su, sv, tu, tv, u?, uv, v7} for Ro the matrix for 
d? : Z2 —>+ Z2 is 


0 0 0 0 yy 0 0 
0 0 0 0 0 —-—Ypo 0 0 
0 0 0 0 -y -y 0 0 
-~ 02 D2 © 08 OG OD we 

=r 0 0 0 0 0 -Yy3s —Yys3 
0 w 0 0 0 0 yw 0 
0 -y 0 0 0 0 =—te —Hi 
0 0 Yy2 0 0 0 0 0 
0 0 -y% y 0 0 0 0 
0 0 0 —-Yy 0 0 0 0 


This matrix represents all the first syzygies of total degree two. The multigraded 
setting gives us extra structure to work with, and in Corollary 14 of [42], Botbol, 
Dickenstein and Dohm give a bound in terms of the multidegree; in the case of 
this example the bound tells us that it suffices to work with the subcomplex of 
the approximation complex of bidegree (1,1). The first syzygies of bidegree (1, 1) 
correspond to the submatrix whose rows are indexed by {su, sv,tu,tv}, which is 
given by 


O -Yy Ys —Y 


The image of ® is the determinantal hypersurface 


H = V(yiysys — y3y3 + 2yryoysya — yt ya) 
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pictured here: 


In Theorems [3.19] and [3-20] in Section B.2] we saw that the structure of the 
syzygy matrix is closely connected to the behavior of the singular locus of the 
implicit curve. This also holds for tensor product surfaces in P? of bidegree (2,1): in 
[308], Schenck, Seceleanu and Validashti show that there are six possible structures 
for the syzygy matrix, and describe the codimension one singular locus for each case. 
In our example above (from [308]) there is a unique syzygy of bidegree (0,1), which 
in turn forces the codimension one singular locus of H to be the union of three lines 
V(y1, 43) U V(y2, 44) UV (41, ya). 

In what follows, the variables yj, y2,y3, ya will be denoted w3, wi, w2, wo for 
consistency with MACAULAY2 output. Since the generators of J are of bidegree 
(2,1), the bidegree (0,1) syzygy occurs in total degree (2,2), and the variables w; 
are of degree (1,0,0). Therefore the generator wy2 — vy; of the Rees ideal is now 
written w,u—ws3u and occurs in degree (1, 2,2), explaining the labelling of the last 
matrix below. 


i1 : R=ZZ/31991[s,t,u,v,Degrees=>{{1,0},{1,0},{0,1},{0,1}}]; 
i2 : I=ideal(s*2*u,s*2*v,t*2*u,t*2*vt+ts*t*v) ; 


i3 : rl=(res coker gens I).dd 


4 5 
1 Ro <p rrrssress R 2 
{2, 1} | -v -t2 0 0) 0) | 
{2, 1} |0 s2 0 -sv-tv -tv | 
{2, 1} |u OO st+t2 0 tu | 
{2, 1} 10 0 -s2 tu -suttu | 
5 2 
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{2, 2} | -t20 | 
{4, i} | v 0) | 
{4, 1} | 0 -u | 
{3, 2} | s-t -t | 
{3, 2} | t  stt | 


i4 : transpose gens reesIdeal I 


04 = {-1, -2, -2} | w_lu-w_3v | 
{-1, -4, -1} | w_2s2-w_3t2 | 
{-1, -3, -2} | w_Otu-w_2sv-w_2tv 
{-1, -3, -2} | w_Osu-w_2sv-w_3tv 
{-1, -4, -1} | w_Os2-w_1st-w_1t2 
{-2, -5, -2} | w_1w_2t+w_Ow_3s-w_Ow_3t-w_1w_3t 
{-2, -5, -2} | w_1w_2s-w_Ow_3st+w_1w_3t 
{-2, -4, -4} | w_072u2-2w_Ow_2uv+w_2*2v2-w_2w_3v2 | 
{-3, -6, -4} | w_1w_2°2v+w_0°2w_3u-2w_Ow_2w_3v-w_1w_2w_3v | 
{-4, -8, -4} | w_1°2w_2°2-2w_Ow_1w_2w_3-w_1°2w_2w_3+w_072w_372| 


Tensor product surfaces are a type of ruled surfaces, on which there is an exten- 
sive literature, ranging from the classical work of Edge and Salmon to 
contemporary work motivated by geometric modeling. We give pointers to relevant 
papers below. <— 


Directions for Future Research. There are a myriad of open questions at the 
interface of geometric modeling and algebraic geometry. A central problem is how 
to leverage results from situations where the free resolution of J is known to obtain 
results on the Rees algebra. For an example along these lines, if J is codimension 
two and Cohen-Macaulay, then J has a Hilbert-Burch resolution: I = (f1,... fa) is 
generated by the (d—1) x (d—1) minors of a d x d—1 matrix, whose columns are 
the syzygies of I. In [276], Morey and Ulrich determine the equations for the Rees 
algebra, when all the syzygies are linear, and gives an answer when exactly 
one syzygy is nonlinear; however the general case remains open. Some additional 
questions: 


e Rees algebras of rational curves and connections to singularities. In addition 
to the works and mentioned earlier, there is a large literature on 
the interplay between syzygies and singularities of rational curves. In [103], 
Cox, Kustin, Polini and Ulrich carry out a comprehensive investigation; see 
also Cortadellas and D’Andrea and [91]. 


e Tensor product surfaces have been extensively studied, but there remain 
many open questions. See [122], [134], [142), 158), for some work on 
low degree cases; more generally [39], [42], study toric implicitization. 


e Matrix representations: rather than finding the actual implicit equation, fo- 
cus on a matrix which drops rank at points of the implicitization. As Botbol, 
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Busé and Chardin write in “matrices have to be seen as implicit represen- 
tations on their own, without relying on the more classical implicit equation”. 
For more work along these lines, see and Theorem [3.34] in Section B.2] 


Codimension three Gorenstein ideals have a structure theorem, and in [248], 
Kustin, Polini and Ulrich determine the equations for the Rees algebra. Is 
there a similar result for Gorenstein deviation two ideals, where there is a 
structure theory due to Huneke and Ulrich and the resolutions are 
known from Kustin [246]? 


The resolution of ideals obtained from the submaximal minors of a matrix of 
variables may be constructed via representation theory and Bott-Borel-Weyl, 
see Weyman for details. What can one say about the Rees algebra of 
such examples? About the simplest case of an Eagon-Northcott resolution? 


While Hochster’s method provides a way to compute the implicit equations 
for the image of a monomial map, computing minimal generators for R(J) 
is nontrivial: see Lin for a beautiful application of Alexander duality 
to the initial ideal of the maximal minors of a generic matrix, and Fouli and 
Lin for additional results on squarefree monomial ideals. 


When R(J) is of quadratic type (determined by Syz(I) and Syz(JI?), 
shows that half the degrees where the torsion is nonzero are understood. 


Methods from tropical geometry: if one can determine the degree of the 
implicit equation of a hypersurface, then the problem of actually describing 
it is linear algebra; in and tropical geometry is used to shed light 
on the problem. 


CHAPTER. 4 
Rigidity Theory 


This fascinating topic involves the study of frameworks built from bars and 
joints in R? and R°. Given such a framework, a key question is whether it is 
rigid, and if not, what motions are possible. Studying this and other questions 
involves a wonderful combination of Euclidean geometry, algebraic geometry, and 
combinatorics. 


4.1. Geometry of Rigidity 


For most of this section, G = (V, ) will be a simple graph with vertex set 
V = [n] = {1,...,n}. An edge joining vertices i 4 7 is denoted 7 € EF. 


DEFINITION 4.1. A bar-and-joint framework in R¢ for G = (V, E) consists of: 
(1) Joints, which are points q; € R@ fori € V. 
(2) Bars, which are line segments qjqj for ij € E. 


We often write a bar-and-joint framework as q = (q;)icev € (R*)”. The map 
V — R? sending i € V to q; € R?® is typically injective. Frameworks where 
injectivity fails are said to be degenerate. 


EXAMPLE 4.2. Here are two bar-and-joint frameworks: 


Rigid but not infinitesimally Caffeine molecule in R® 
rigid framework in R? 


The framework on the left previews of some flavors of rigidity we will encounter. 
The molecule on the right can be modeled by a framework. Chemical properties 
of a molecule can change when folded (important in the study of proteins). This 
illustrates how rigidity (or lack thereof) arises naturally in applications. << 


In a bar-and-joint framework, the bars meet at ball joints that can rotate in all 
directions. The idealized version presented in Definition [4]] represents bars by line 
segments of zero thickness and joints by points also of zero thickness. 

A related type of framework allows bars to meet at rigid bodies. 
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DEFINITION 4.3. A body-and-bar framework in R¢ for G = (V, E) consists of: 
(1) Bodies, which are rigid bodies B; C R¢ for i € V. 
(2) Bars, which are line segments q;qj with q; € B; and q,; € B; for an edgee € E 
connecting vertices 7,7 € V. 
In a body-and-bar framework, G is allowed to be a multigraph with no loops. 


EXAMPLE 4.4. A well-known body-and-bar framework is the Stewart-Gough 
platform. Here is a picture from a paper by Bohigas, Maunbens and Ros [35]: 


The graph G consists of two vertices connected by six edges. One rigid body (the 
base) is fixed to ground. By varying the lengths of the bars (the six legs), the other 
rigid body (the platform) can change orientation. Stewart-Gough platforms have 
many applications, including flight simulators and radio telescopes. <— 


You will learn more about body-and-bar frameworks in Section [4.3] For now, 
we focus on the bar-and-joint case. Before giving a rigorous definition of rigidity, 
let’s give some examples that rely on an intuitive notion of rigidity. 


EXAMPLE 4.5. The rigidity of a framework can depend on the dimension of 
the ambient space. Consider the framework: 


This is rigid in R?, but in R°, the two triangles can be swung back and forth along 
the diagonal. <- 

EXAMPLE 4.6. A bar-and-joint framework whose graph is the complete graph 
K,, is rigid in all dimensions. For example, 


This framework has graph Ky and can be regarded as lying in R@ for any d > 2. It 
is clearly rigid. Note that bars are allowed to intersect. <- 
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EXAMPLE 4.7. Here is an example based on the four-bar mechanism studied 
in Example 2.15]in Section 2.2} This framework consists the quadrilateral ABCD 
with the triangle CDE attached to one edge. When we fix A and B and spin AD 
in a circle with center A, the point EF traces out the curve show above. We found 
its equation in (2.21). <P 


EXAMPLE 4.8. A generic Stewart-Gough platform from Example[4.4]with fired 
edge lengths is rigid. In this case, one can show that there are at most 40 platforms 
having the same lengths. See, for example, Example 6.3]. 

However, motion is possible in special cases. A Griffis-Duffy Type I platform is 
a Stewart-Gough platform where the base and upper body are equilateral triangles 
with legs attached at the vertices and midpoints of the edges as indicated here: 


With fixed edge lengths, there is now one degree of freedom, which gives the curve 
shown in red. This picture was created by Doug Arnold and Charles Wampler for 
the 2006-07 program on Applications of Algebraic Geometry at the Institute for 
Mathematics and its Applications. The image (41) also appears on the cover of 
the SIAM Journal on Applied Algebra and Geometry shown on page [95] << 
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Motion of a Framework. Before we can define rigidity, we need to understand 
what it means for a framework to move. For simplicity, we focus on bar-and-joint 
frameworks q = (qi)icv in R@ for the graph G = (V, E). 

Since we are only interested in motions that preserve the edge lengths ¢;; = 
lla: — q,|||? for ij € E, we have the following definition. 


DEFINITION 4.9. A motion of q = (qi)icv in R?@ is a differentiable family of 
frameworks q(t) = (qi(t))iev for 0 < t < 1 such that: 
(1) £:; = |las(¢) — a;(t)||? for all «7 € E and all t € (0, 1). 
(2) qi(0) = q; for allie V. 

However, some motions are boring. To define this precisely, we need to recall 


the isometry group of R?. 


DEFINITION 4.10. The Euclidean group E(d) consists of all isometries of R¢. 
The special Euclidean group SE(d) is the subgroup consisting of all orientation 
preserving isometries of R?. 


It is well known that any element of E(d) can be written v + Av + b, where 
A € O(n) is orthogonal and b € R¢ represents translation. This lies in SE(d) if and 
only if det(A) = 1, so that SE(d) C E(d) has index 2. Furthermore, 


2 2 


Subsets S,T C R¢@ are congruent if S = y(T) for some y € E(d). 
We now define a boring motion of a framework. 


(4.2) dim E(d) = dim SE(d) = dim SO(d) + dimR? = (5) id= (‘ Z ‘), 


DEFINITION 4.11. A motion (q;(t))iey of a framework is trivial if there is a 
differentiable map y : [0,1] > SE(d) with 7(0) = J, and 
gi(t) = (t) (ai) 
for alli € V. 


Flexibility and Local Rigidity. Any framework in R@ clearly has many trivial 
motions. But are there more? This leads to the following definition. 


DEFINITION 4.12. A framework in R?@ is flerible if it has a nontrivial motion. 
Otherwise the framework is locally rigid. 


Later in the section we will see why the adjective “locally” is relevant. We next 
give a lemma that is useful when thinking about local rigidity. 


LEMMA 4.13. Let q = (qi)icv be a framework in R? with graph G = (V,E). 
If G has a subgraph G’ = (V', E’) such that (qi)icy’ ts locally rigid with respect to 
the edges ij € E’, then the following are equivalent: 

1) The whole framework is locally rigid. 
2) Any motion of q firing q; for alli € V’ is trivial. 
Furthermore, if (qi)icv’ affinely spans R¢, then (1) and (2) are equivalent to: 
3) Any motion of q fixing q; for alli € V' fixes the whole framework. 
PROooF. (1) = (2) is obvious. For (2) = (1), suppose that we have a motion 


qi(t))iev of the whole framework. Since (qi);ev’ is locally rigid, there is a map 
7: [0,1] — SE(d) such that q;(t) = y(t)(q:) for i € V’. Then q;(t)) = y(t)~toqi(t) 


4.1. GEOMETRY OF RIGIDITY 141 


is a motion of the whole framework that fixes q; for i € V’. By (2), this is a trivial 
motion of the whole framework, and it follows easily that the original motion is 
also trivial. 

Finally, assume (q;(t))icy affinely spans R?. Elements of SE(d) preserve affine 
spans, so the only way y(t) € SE(d) can fix (qi)iev’ is to be the identity. From 
here, (2) = (3) follows easily, and then we are done since (3) = (2) is obvious. 


EXAMPLE 4.14. We claim that the following framework is locally rigid: 


F = (-2,2) 


D = (2,0) 
(4.3) 


E = (-2,-2) 


This is one of the frameworks from Example [4.2] 

The outer triangle is locally rigid since it comes from a complete graph. By 
Lemmaf4.13] we need only consider motions that fix D, E, F. Since motions preserve 
lengths, the original inner triangle ABC (shown in gray below) must rotate along 
the red circles to a new inner triangle A,B,C; (shown in blue): 


PS a 
rf TS» p 02.0) 


Cea 


Write Ay = (41,91), Bi = (x2, y2), Ci = (#3, y3). Then preserving the six lengths 
A,D, B\E,C\F and A,B, B,C, A,C shows that 2;, y; satisfy the six equations 


(21-2)? + yf =1, (w+2)? + (yo+2)? = 2, (@3+2)? + (ys—2)? = 2 
(1-22)? + (yi—Yy2)? = 5, (2-33)? + (yo—y3)? = 4, (w1—-23)? + (yi—ys)? = 5. 
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Eliminating y1, 72, y2,%3, yz, we see that x, satisfies the equation 
7743 — 565621 + 890x? — 1272x? — 2105a4 + 4002? = 0. 


This quintic has roots x, = 1, 5.8, —1.7697, 0.116098 + 1.368377. The only root that 
leads to real values of the other variables is x, = 1. Substituting 7; = 1 into a 
lexicographic Grébner basis and simplifying gives the polynomials 


(4.4) yp t2—-ywtlytytlastytily +y3—1. 
These make it easy to see that A; = A, B, = B, C, = C. Thus any motion fixing 
D,E, F also fixes A, B,C, and rigidity follows by Lemma [4.13] 

One interesting feature of is the polynomial y?, which from the point of 
view of scheme theory tells us that when 71 = 1, we have an infinitesimal solution 
in the y; direction. This is related to the failure of (44) be infinitesimally rigid, as 
we will see below. <> 


Global Rigidity. Here is the strongest notion of rigidity. 


DEFINITION 4.15. A framework q in R@ for G = (V, E) is globally rigid if any 
framework q in R? for G with the same edge lengths as q is congruent to q. 


Any framework for the complete graph K,, is globally rigid in any dimension, 
and it is also easy to show that global rigidity implies local rigidity. The converse 
fails because even if a framework has only trivial motions, it might be able to jump 
to something different. 


EXAMPLE 4.16. Consider the following frameworks in R?: 


(a) (b) 


Example [4.14] shows that framework (a) is locally rigid. The proof can be easily 
adapted to show global rigidity. On the other hand, the framework q in (b) is 
locally rigid in R?, but reflecting the upper left triangle about the diagonal (the 
“jump” ) gives the non-congruent framework q with the same underlying graph and 
edge lengths. So q is not globally rigid. << 


EXAMPLE 4.17. Examples[4.5)and [Z.6]use an intuitive idea of rigidity. We now 
see that Example [4.6Jis globally rigid in any dimension > 2 since its graph is Ky. 

On the other hand, Example [4.5] is locally rigid in R? but not globally rigid 
since the lower right triangle is a degenerate framework with the same graph and 
edge lengths. To see why this makes sense, consider q and q in Example [4.16{b). 
Deform the upper left vertex of q until the two triangles become congurent. As 
this happens, q deforms to a degenerate framework supported on a triangle. 

It follows that Example [4.5] and its lower right triangle are a limiting case of 
Example [4.16[b). This shows how degenerate frameworks can arise naturally. <> 
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Infinitesimal Rigidity. By definition, motions of a framework are differentiable. 
Their derivatives will lead to a new flavor of rigidity. 
Fix a framework q = (qi)icv in R¢@ with ¢;; = ||q; — qj||? for 7 € E. Given a 
motion {qi(t)}iev, its derivative at t = 0 is 
qi = “ ailt) 
Motions preserve length, so that ||q;(¢)—q,(t)||? = 4; holds for all t. Differentiating 
this equation and setting t = 0, we get the system equations 


(4.5) (ai —a5)-(Q;-93)=0, WEE, 
which are linear in the qj. 


t=0 


DEFINITION 4.18. A framework q is infinitesimally rigid if the only solutions 
(qi )iev of (4.5) are those coming from trivial motions of q. 


In 1979, Asimow and Roth §3] proved the following basic result. 
THEOREM 4.19. Infinitesimal rigidity implies local rigidity. 


In general, the converse of Theorem [4.19] can fail, as shown by the following 
familiar example. 


EXAMPLE 4.20. Recall the framework analyzed in Example [4.14] 
F = (-2,2) 


D = (2,0) 


E = (-2,-2) 


As before, we can fix D, E, F, so that D! = E’ = F’ =0. Then (4.5) becomes: 


(A—D)-A'=0 = A’ = ae 

(B-—E)-B'=0 => B' = Ble, —- e2) 

(C—F)-C'=0 = C'=7(e1 + €2) 
BoB) (i Ba) = Sp 
(A-—C)-(A’-C’)=0 = a=-y7 
(B-C).(B—C')=0 = B=-, 


where {e1, €2} is the standard basis of R?. The solution (a, 8,y) = (—1, -1, 1) gives 
A! = —e2, B' = —e; + €2,C’ =e; + €2, shown above in red. 

We conclude that this framework is locally rigid but not infinitesimally rigid. 
This happens because the lines AD BE ; GF intersect at a common point (shown 
above in gray). A generic framework on the same graph is infinitesimally rigid. <> 
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The Rigidity Matrix. A nice aspect of infinitesimal rigidity is that it involves a 
system of linear equations (4.5). The matrix of this system will play an important 
role in what follows. 

Suppose that the graph G = (V, E) has n = |V| vertices and m = |E| edges. A 
framework q = (qi)icv for G in R@ gives a linear map 


Te,q : (R*)"” — R™ 
that maps velocity vectors (qj,...q/,) € (R%)" to the dot products 
(..., (qa; — ay) : (dp Jie) oa ER”. 
The matrix Rgiq of Tg.q is the rigidity matrix of the framework q. Note that the 


null space of Rg.q consists of the solutions of the system (4.5). This leads to our 
first result. 


THEOREM 4.21. If q = (qj,---,Gn) affinely spans R¢, then: 
(1) rank(Re.q) < dn — eae ; 
(2) q is infinitesimally rigid if and only if rank(Re,q) = dn — ( Z ie 


PrRooF. The initial velocity vector of any motion of the framework gives an 
element of the kernel of Tg,g. This includes trivial motions, so that 


d+1 
nullity(Re.q) = dimker(T¢.q) > dim SE(d) = ( 7 i 
where the middle inequality holds because q affinely spans R?. Then (1) follows 
from nullity(Re,q)+rank(Re,q) = dim(R“)” = dn. For (2), recall that infinitesimal 
rigidity means that all solutions of (4.5) come from trivial motions. Hence 


1 
q is infinitesimally rigid <=> nullity(Re.q) = dim SE(d) = @ i; 


This translates to the rank condition in (2), and we are done. 
Since Rg,q has m = |E| rows, we get the following nice corollary. 


CoROLLARY 4.22. If q =(qi,.--,Qn) affinely spans R¢, then: 
(1) m<dn— (") if Ra,q has linearly independent rows. 
(2) m>dn— Ce) if q is infinitesimally rigid. 
ProoF. In the situation of (1), m = rank(Rgq), and then we are done by 


Theorem [21[1). In the situation of (2), we know that rank(R¢,q) = dn — (*5') 
by Theorem (4.212), which is bounded by the number of rows m of Raq. 


The appearance of “linearly independent rows” in Corollary [4.22] hints at the 
matroidal aspect of rigidity. We will explore this further in the next section. 


Other Flavors of Rigidity. So far, we seen three flavors of rigidity: local, global, 
and infinitesimal. There are also variants to consider: 


e Each of these has a minimal version, where the property fails if any edge is 
removed. 
e Each of these has a generic version. We will say more about this below. 


We will also learn about combinatorial rigidity in Section [4.2] 
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Algebraic Geometry. It is now time to explore the algebraic geometry of rigidity. 

We begin with the configuration variety, which for G = (V,E) parametrizes all 

frameworks in R¢ with given edge lengths. As before, we set n = |V| and m = |E}. 
From G, we get the length-squared map lc. : (R7)" + R™ defined by 


fg,a(a) = (lai — agll?)ien- 


This is a polynomial map since we are squaring the lengths. Then, given @ € R&, 
the configuration variety of G and £ is 


(4.6) Coas =f 2) S (R*)”. 


This is a real algebraic variety. If q is a framework for G in R?, then Ce.a.¢ for 
é = lg.a(q) consists of all frameworks for G with the same edge lengths as q. 
One easily shows that the Jacobian matrix of gq at q € (R“)” satisfies 


(4.7) $I3lc,a(a) = Reg: 


so that up to a constant, the rigidity matrix Rg¢,q computes the derivative of the 
length-squared map. This has the following nice consequence for the associated 
configuration variety. 


THEOREM 4.23. Let q be a framework for G in R4 with € = lg.a(q)- If Reg has 
rank m, then the configuration variety Ce,ae is smooth at q of dimension dn —m. 


Proor. Thinking of C¢.a,¢ as a real manifold, this follows from the Implicit 
Function Theorem since Jég.a(q) has rank m by (47%). Translated to algebraic 
geometry, the rank condition implies smoothness of the desired dimension. 


Here is a fun application. 


EXAMPLE 4.24. Let P C R® be a 3-dimensional polytope. The edges of P give 
a framework q in R? whose underlying graph will be denoted G. In 1978, Asimow 
and Roth used results of Cauchy and Gluck to prove the following theorem. 


THEOREM 4.25. The edge framework q is locally rigid for G in R? if and only 
if every face of P is a triangle. 


PROOF. First suppose that the faces of P are triangles. Since P is also convex, 
it follows that any small perturbation of the vertices gives a combinatorially equiv- 
alent polytope. Any motion q(t) starts from q, so that for small values of t, q(t) 
will be the edge framework of a polytope P; combinatorially equivalent to P. Since 
triangles are globally rigid, the corresponding faces of P and P, will be congruent. 
Then an 1813 theorem of Cauchy implies that P and P; are congruent, and 
local rigidity follows. 

For the other direction, assume that q is locally rigid in R® and set € = ¢.3(q). 
Ideas of Gluck from 1975 imply that Jé¢.3(q) has rank m since q comes from 
a convex polytope (see Theorem 6.3] for a careful proof). By Theorem [4.23] 
it follows that the configuration variety Cg,3,¢ has dimension 3n — m. 

The affine span of q is R® since dim(P) = 3. Hence the SE(3)-orbit of q is 
smooth of dimension dim(SE(3)) = eee = 6. Local rigidity implies that this orbit 
coincides with Cg 3,¢ near q. By the previous paragraph, we conclude that 


3n —m = 6, hence m = 3n— 6. 
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To see why this forces the faces of P to be triangles, we use Euler’s formula 
f-m+n=2, 


which connects the number f of faces of P to the number m of edges and number 
n of vertices. Then we obtain 


m = 3n —6 = 3(n — 2) = 3(m— f) = m+ (2m — 3f), 


which implies 2m — 3f = 0. However, in the polytope P, every edge meets exactly 
two faces and every face has at least three edges. This tells us that 2m — 3f > 0. 
The only way 2m — 3f > 0 can become 2m — 3f = 0 is for every face of P to have 
exactly three edges, i.e., a triangle. Thus P has triangular faces, as desired. 


The 1813 theorem of Cauchy mentioned above has an elegant proof that can 
be found in Proofs from THE BOOK [6]. > 


The Cayley-Menger Matrix. When G = K,, the image of the map 
(4.8) lx,a 2 (R9" + R@) 


can be described explicitly using equations and inequalities. We begin with an 
example of the equations involved. 


EXAMPLE 4.26. Three points q1,q2,q3 € R? determine a triangle whose area 
is given by Heron’s formula 
A= <s(s — a)(s — 6)(s — c) 


for edge lengths a = ||qi—qga||,b = |/qi—qs||,c = ||q2—q3|| and semiperimeter 
s=4(a+b-+c). After some algebra, one gets the equation 


0 i} 1 1 

1 0 lai—qe||?— ||a1—aa3||? 
—16A? = det 

1 |a2—a;||? 0 lla2—q||? 

if las—a;||? |as—q2l|? 0 


In particular, the determinant vanishes when the points q1, q2,q3 lie on the hori- 
zontal axis in R*. It follows that since 0,1 : (R')? > R? is defined by 


€x,1(41, 42, 43) = (|! aI, |lqi1 all”, |la2—asll7), ai, 42,43 € R'=R, 


the above determinant gives a polynomial that vanishes on the image of 0,1. <> 


Cayley’s first paper [69], published in 1841, generalized this result to higher 


dimensions. Suppose that we have n points q = (qi,---,@n) in d-dimensional space 
R?. Then we have the (n+ 1) x (n+ 1) Cayley-Menger matrix 
0 1 1 1 ve 1 
1 0 \|a.—a2||?_— |Jqax—as||?_ +» |lqa—an |? 
1 ||a2—a1 ||? 0 llq2—as||?_--- — (la2—anl|? 
CM(4) = }1 lqs—ail? las—aa? = 0 ls al? 
1 |lqn—anl|?— |lan—a2 ||? |lan—aasl|?_ ++ 0 


The n points q determine an (n — 1)-simplex in R? whose (n — 1)-dimensional 
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volume V,,_1 satisfies 
(—1)"2""1((n — 1)!)?V2_, = det CM(q) 


(see Section IV.40]). Example [4.26] is the special case when n = 3. 

The volume V,,_1 is positive if and only if qi,...,qQ» are affinely independent in 
R¢. This is impossible if d < n—1, so that det CM(q) = 0, ie., rank CM(q) < n+1 
in this case. In fact, the dimension d puts a strong upper bound on rank CM (q). 


THEOREM 4.27. For q € (R“)", the matrit CM(q) has rank < d+ 2, i.e., its 
(d+ 3) x (d+3) minors all vanish. 


PROOF. Given qj,...,dn € R@ and 2 < i,j <n, the law of cosines for the 
triangle with edges qi — qi, Gj — G1, Gi — Gj implies that 


(4.9) (ai — a1) - (Qj — a1) = 3 (las — all? + lay — all? — lla — ay”). 
These dot products form the Gram matrix of qo — q1,---,Qn — 41, denoted S1(q) 
(the subscript 1 indicates subtraction by qi). The matrix $;(q) is symmetric of 
size (n— 1) x (n—1). 
To relate CM(q) and 5;(q), let 

bij = |lai — all’, 1<ijcn 

sig = 9 (lin + fj — 43), 2S 4,5 
Note that bi = Cn and Sig = Sis and sig = (qi = qi) : (q; = qi) by (£9). This 
allows us to write the Gram matrix as $(q) = (sij)?j;22. Then 

—291(q) = (—28j)7 ja2 = (Cig — Cir — £15) j=2 


leads to the following row and column operations on CM (q): 


O 1 1 +. 1 ot Dee 0] 010 -.. 0 

LO &2 ++ ban 1 O big +++ bin 1 0 0 -::. O 

1 fy, O ++ bon} A }O boy B \0 0 

a. 2 fF se § a 4 ~251(q) 2} ~251(q) 

1 fri bne +++ 0 O ent 0 O 
We index the rows and columns of CM(q) by 0,1,...,n. The row and column 
operations “, subtract row and column 1 from rows and columns 2,...,n, and the 
row and column operations & subtract suitable multiples of row and column 0 from 
rows and columns 2,...,n. Row and column operations preserve rank, so that 
(4.10) rank CM (q) = 2+ rank S;(q). 


Since q2 — qi,---;Gn — qi € R@ implies rank $;(q) < d, the theorem follows. 


EXAMPLE 4.28. When n = 3 and d = 1, CM(q) is the 4 x 4 matrix shown in 
Example [4.26] Then det CM(q) = 0 in this case follows from Theorem [4.27] since 
d+3=4., <P 


The rank condition of Theorem |4.27] gives equations that vanish on the image 
of the map (4.8). However, there are also inequalities, since the Gram matrix S1(q) 
from (49) is positive semidefinite (PSD). Let’s use this to characterize the image 
of Lx, a: (R4)" > RG), We will use ¢;; = €;; for coordinates of R(2), so €x,.4 is 
defined by ;; = ||qi — qj||? for q € (R%)”. 
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THEOREM 4.29. A point ¢ = (l;) € R() is in the image of the length-squared 
map £x,,,a: (R4)" > R@) if and only if 


0 1 1 see 1 
1 0 big +s bin 

(1) rank 1 fay O +++ ban <d+2. 
1 Lat i) ea 0 


(2) The (n—1) x (n—1) matriz S = (Sig )Pj=2, si = 4 (Ci + bj, — ti), is PSD. 


PROOF. One direction follows from the above discussion. For the converse, let 
S' be PSD of size (n—1) x (n—1). Then (1) and (410) imply that rank S < d. Being 
PSD, § is the Gram matrix of n — 1 vectors in R"~!, which we denote qz,...,qn- 
Set qi = 0 € R™! and let W = Span(qz,...,qn). The rank condition on S$ implies 
that dim W < d, so we can regard q1,...,Qn as vectors in R?. From here, it follows 
easily that li = lq: = q;|l? for 1 < a] <n. 


The name “Cayley-Menger” was chosen to honor Cayley’s 1841 introduction 
of the matrix CM(q) and Menger’s 1931 characterization of the image of 
(x,,,aq using equations and inequalities [270]. The connection to PSD matrices via 
sig = $(Li1 + C51 _ tg) is due to Schoenberg in 1935 (309). 


The Cayley-Menger Variety. The equations and inequalities defining the image 
of fa: (R4)" > R() in Theorem G29 give equations that define the Zariski 


closure of the image in RG). T his remains true over C, which allows us to apply 
some classic results from algebraic geometry. 
The defining equations of the image are easily seen to be homogeneous in the 


coordinates ¢;; of c(2). Hence they define the Cayley-Menger variety 
CM xe r=, 
We compute the dimension and degree of CM y,4,c as follows. 
THEOREM 4.30. The Cayley-Menger variety CMn.a,c © P(2)-! has dimenion 
dim CMn,ac = dn — (“4") -1 


and degree n—d—2 ( n-1+6 ) 
deg CM na,c — if “Ea 
£=0 oa 


PRooF. We follow Borcea and Streinu [36]. The change of variables 84 = 
5 (Ei1 + bey = Py) in Theorem [4.29] transforms the image of fx, 4 into the set of 
(n — 1) x (n—1) symmetric PSD matrices of rank < d. Proposition 5.1.1 of 
implies that the Zariski closure is the variety 


Sym,_1.4aR = {(n—1)x(n—1) real symmetric matrices of rank < d}. 


This remains true over C, so that projectively, the change of variables induces 
is isomorphism of projective varieties 


(4.11) CMn,ac ~ PSym,_1,4,c> 


where PSym,,_ Cc p(2)-1 is the projective version of Sym,,_ Cc c@). 
yM,-1,4,C YMp—1,d,C 
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The variety PSym,,_; a,c is a classic example of a determinantal variety. Its 
dimension and degree were computed by Gambelli in 1905 and are given by 
the formulas in the statement of the theorem. Modern proofs are due Jésefiak, 
Lascoux and Pragacz in 1982 and Harris and Tu in 1984 [188]. 


Back to Frameworks. The Cayley-Menger variety CM,y,a,c is lovely, but what 
does it have to do with rigidity? The map @x, q¢ uses the complete graph K,,, whose 
frameworks are always rigid. How can this be interesting? 

What saves the day is that K, has lots of interesting subgraphs. In fact, any 
graph with n vertices is isomorphic to a subgraph of K,,. This simple observation 
has many consequences, including the rigidity matroid to be defined in Section [4.2] 
and a fun application of Cayley-Menger varieties due to Borcea and Streinu [36], 
which we will now explain. 

We first set up some standard notation: 


= {1,...,n} 
in) = ii} lige ln SAG} = {i | 4,5 € [n], iF 5}. 
Thus K,, has vertices [n] and edges 7 and an arbitrary graph G with n vertices 


can be represented as G = ((n],£) for E C (ml). In this notation, the length- 
squared maps for K,, and G fit together to give the commutative diagram 


(4.12) 


(R4)n Se" _, RG) 
(4.13) fa. | 


R™ 


where m = || and 7g is projection onto the coordinates of R() indexed by E. 
Given a framework q for G = ([n], E) in R?, we define the affine subspace 


Lg = 75 (Ec,a(a)) CRO). 


Using this, we can describe congruence classes of frameworks for G using Cayley- 
Menger varieties as follows. 


LEMMA 4.31. Let q be a framework for G = ([n], E) in R¢. Then: 


(1) If q is another framework for G in R¢ with the same edge lengths as q, then q 
is congruent to q if and only if €x,,,a(q@) = &x,,,a(q) 
(2) There is a bijection 


pee ie classes of frameworks for G 


A dyn 
in R¢@ with the same edge lengths as q } ata tx,,a(R ) ). 


ProoF. (1) follows because q is globally rigid when regarded as a framework 
for K,. For (2), note that if we have a framework q for G = ([n], E) in R¢, then 
q has the same edge lengths as q = > é¢,4(q) = lc,a(q) 
<= Tx(lK,,,a(4)) = 4c,a(4) 
<= ¢,,a(@) € ™ (Ee,a(4)) = La. 
Combining this with (1), we see that the desired bijection is given by 


congruence class of q+—> lx, a(q). 
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This lemma implies that counting congruence classes reduces to counting points 
in the intersection of the affine subspace Lg and the image ¢x,, a((R")”). However, 


e if we replace the image with its Zariski closure over C, and 
e if the affine subspace has complementary dimension, 


then the intersection would compute the degree of Zariski closure. In the projective 
setting, this would be the degree of the Cayley-Menger variety, which we know from 
Theorem [4.30} This is intriguing. Is there a theorem lurking here? 


Generic Minimally Rigid Frameworks. To flesh out the ideas in the previous 
paragraph, we first note that the homogeneous polynomials defining the Cayley- 


Menger variety CMy.a,c © p(2)-! also define its affine cone 


“CM n.4,C Cc c(2) ) 


iw) 
NV 
a) 
Qa 
— 
Ns 
~~ 


which is the Zariski closure of £x,,,a((R*)”) EC RG 
affine cone, Theorem [4.30] tells us that 


d41 
dim "OMnac = dn — ( - ) 


Translated to the 


It follows that in the previous paragraph, we want to use an affine subspace of 
complementary dimension, i.e., of codimension dn — ee” But in Lemma|4.31) Lg 
has codimension m = |E| since it is a fiber of the projection map 7 fz : RG) = R™. 
Hence we will consider graphs G = ([n], £) satisfying 


m =|E| =dn—- (“3'). 


For a framework q in R¢ for G, we make three further assumptions: 

e q=(qi,---,qn) affinely spans R?. 

e q is locally rigid for G. 

e The edge lengths gq are generic in R™. Below we explain what this means. 
Before stating our main theorem, we explore some consequences of these hypotheses. 
In Theorem [4.19] we noted that infinitesimal rigidity implies local rigidity. While 
the converse can fail, it holds generically, as we now explain. 


THEOREM 4.32. When the framework q is generic, infinitesimal rigidity is 
equivalent to local rigidity. 


Proor. In §3], Asimow and Roth prove that q is infinitesimally rigid for 
the graph G if and only if the Jacobian Jég.q has maximal rank at q and q is locally 
rigid for G. Then we are done since the rank condition is satisfied for generic q. 


LEMMA 4.33. The above hypotheses imply that q is minimally locally rigid for G. 


ProoF. If q is not minimally locally rigid for G, we could remove an edge 
to get a subgraph G"’ such that q would be locally rigid for G’. By the previous 
theorem, it would also be infinitesimally rigid for G’. Since G’ has |E|-—1=m-—1 
edges and q affinely spans R¢, Corollary 4.22] would imply that m—1 > dn— Gar 


— 


This is impossible since we are also assuming m = dn — ( 


Here is a 2004 theorem of Borcea and Streinu [36]. 
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THEOREM 4.34. For G and q as above, the number of congruence classes of 
frameworks for G in R¢ with the same edge lengths as q is bounded by 


in| n—-1+é ) 


II n—d—1-—£ 


wo Ce) 


PROOF. Since “CM, ac © c(2) is the Zariski closure of the image of the map 
lx,a: (R4)" > R(), (£13) gives the commutative diagram of varieties: 


K 


L d 
(C2)” ——"" 5°CM, ac 
laa 
(4.14) “, | i. 


Cc™ 


where @zg is the restriction of the projection c@) > C™ to *CM y,.a,c. Our hypothe- 
ses and Theorem[£-2]]imply that the rigidity matrix Rg. has rank dn— ees 
By (£7), Raq is the Jacobian of gq at q up to a factor of 4. It follows that lg q 
is dominating, which implies that dg in (4.14) is also dominating. Since 


d+1 
dim “CM yn a,c => dn — ( 3 ) =m, 


=m. 


we conclude that @z :°“CMn,ac — C™ has finite degree. Hence there is an integer 
D> 0 such that a generic fiber of dg has D points. We assume that q is generic 
in this sense, i.e., by (a) has D points. 

By Lemma [4.3i]and the above diagram, we obtain 


Gre eae classes of frameworks for G 


= d\n -1 
in R? with the same edge lengths as q } ~ Lg bx,,a((R)") S oe (a). 


When q is generic, there are at most D congruence classes of frameworks for G. 
To finish the proof, we need to show that D is the product in the statement 


of the theorem. Via Theorem [4.30] this reduces to proving that CM na C p(2)-1 
has degree D. To see why, note that dp : °CM pac + C™ is a projection of generic 
degree D between varieties of dimension m. This induces a rational map 


(4.15) Op: CMnac --> P™ 


of generic degree D between projective varieties of dimension m — 1, where ®zg is 
the restriction of the projection map 


He: p(2)-2 _-5 pm-1 
induced by mz: c(2)-! 5 ©”, It follows that for q generic 
(4.16) ®5 (a) = Me (a)N CM ac 


has D points. We will show that for generic q, the intersection has deg(CM y,4,c) 
points. The key idea is to consider three subvarieties of CM ya,c: 

e Points where ®z is not defined (the base locus). 

e Points where CM,,a,c is not smooth (the singular locus). 

e Points where ®g ramifies (the ramification locus). 
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These form a proper subvariety W C CM,a,c, hence the same is true for its image 

under ®z since ®g is a dominating rational map between varieties of the same 

dimension. Then q € (R%)” is generic when £¢,4(q) avoids the the image of W. 
Since we avoid the base locus, can be written 


®;'(q) = Il," (a) a CM n.,c, 


where It; (a) is a linear space of codimension m— 1. Since the intersection is 
finite, Bézout’s Theorem tells us that the intersection consists of deg(CM n.a,c) 
points, counted with multiplicity. But our choice of q guarantees that all points of 
the intersection occur at smooth points of CM,,,4;c where ®g is unramified. This 
means that all of the multiplicities are 1, so that intersection has deg(CMy,4,c) 
points. But it also has D points, which gives the desired equality. 


Recent Developments. The proof of Theorem[4.34]uses the affine Cayley-Menger 
variety “CM na.c C c(2) and the diagram (4.14), both of which have their origins 
in the commutative diagram 


(Ray —“* 5 RO) 
laya 
TE 
R™ 


In contrast, the two recent papers 


e by Capco, Gallet, Grasegger, Koutschan, Lubbes and Schicho (2017) 
° by Bartzos, Emiris, Legersky and Tsigaridas (2018) 


take a different point of view. They assume d = 2 or 3 and completely ignore Ky, 
focusing instead on the configuration varieties 


Ce,ae = €G'a(€) C (R*)” 


defined in (4.6). The group SE(d) acts on Ce.ae, and and find a way 
(different in each paper) to reduce to an explicit system of polynomial equations 
with finitely many solutions over R and C. The number of solutions over R gives the 
number of frameworks for G in R¢ with fixed edge lengths up to SE(d) equivalence, 
while the number over C is easier to deal with (both practically and theoretically) 
and gives an upper bound for the number of frameworks that is much better than 
the bound given in Theorem |4.34| These papers use a variety of tools, including 
tropical geometry, bigraphs and recursion in and mixed volumes and homotopy 
continuation in [16]. We expect further progress as people continue to study these 
wonderful problems. 


Final Comments. This section introduced various flavors of rigidity and explored 
the connections to algebraic geometry. However, to unlock the full power of rigidity 
theory, we need to bring combinatorics into the picture. This is our next task. 


4.2. Combinatorics of Rigidity 


The combinatorical aspects of rigidity are best understood using the language 
of matroids. A key player is the rigidity matroid, which has close relations to 
both the rigidity matrix Rx, q and the Cayley-Menger variety CMp.a,c. We will 
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explore some properties of this matroid and give a classic result about minimally 
locally rigid frameworks in the plane due to Laman and Pollaczek-Geiringer. Then 
algebraic statistics will make a surprise appearance. 


Matroids. We begin with two examples of matroids. 


EXAMPLE 4.35. Suppose we have a vector space V over a field & and a finite 
subset S = {v1,...,ue} C V. A basic idea in linear algebra is linear independence, 
which leads to the following collection of subsets of S: 


LT={T CS | T is linearly independent over k}. 


The collection Z has three interesting properties: 

ee. 

e UCT €T implies U € TZ. 

e U,T €T with |U| < |T| implies Ju; € T \ U such that UU {u;} € T. 
The first two properties are obvious, while the third, called the exchange property, 
takes a little thought. <- 


EXAMPLE 4.36. Suppose we have a field extension k C K and a finite subset 
S = {a1,...,ae} C K. A basic idea in field theory is algebraic independence, which 
leads to the following collection of subsets of S: 


T={T CS | T is algebraically independent over k}. 


The collection Z has three interesting properties: 

ee. 

e UCT €T implies U € TZ. 

e U,T €T with |U| < |T| implies Ja; € T \ U such that U U {a;} € T. 
The first two properties are obvious, while the third, called the exchange property, 
takes a little thought. <- 


The parallel language used in these examples is deliberate. In general, a matroid 
is a collection Z of subsets of a finite set S that satisfies the properties in the 
examples. We call S the ground set, and E C S is independent when E € T. 
Example [4.35] is called a linear matroid, while Example [4.36] is not surprisingly 
called an algebraic matroid. 

A nice introduction to matroids (with an emphasis on algebraic matroids) can 
be found in the expository paper [297]. A more complete treatment of matroid 
theory appears in the book Chapter 3]. All matroids considered here will be 
either linear or algebraic (sometimes both, up to isomorphism). 


The Rigidity Matroid, Definition 1. Given a framework q = (q1,..-,4n) € 
(R¢)” for the complete graph K,, in R?, the rigidity matrix Rx, 4 from Section 1] 
is the matrix that represents the linear map 


(qi,...qi,) € (R2)" +> ((as — ay) - (aj - 5) spe (im) € RG), 


where (1) is edge set of K,, defined in (412). To simplify notation, we write Rg 
instead of Rx, q- This leads to our first definition of the rigidity matroid. 


DEFINITION 4.37. Pick a point q € (R7)" whose coordinates are algebraically 
independent over Q. Then rigidity matroid Ra(n) is the linear matroid whose 
independent sets are the linearly independent rows of the rigidity matrix Rg. 
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If q is another such set of points, there is an automorphism o : R > R with 
o(q) = q. It follows easily that Ry and Rg give the same matroid. Thus Ra(n) is 
the linear matroid of the rows of the rigidity matrix of a framework for K,, in R@ 
that is as generic as possible. 


Since the rows of the rigidity matrix Rg are indexed by (2), we can use ey) 


as the ground set for the rigidity matroid Ry(n). Then a subset EC a) picks out 
a collection of rows in Rg and simultaneously determines a subgraph of K,,. This 
is our next topic. 


Edge Subgraphs of K,,. Take a subset E C (1) and let 
V(E) = {vertices of K,, incident to an edge in EF} C [nl]. 


We call G(E) = (V(E), FE) an edge subgraph of K,, since it is determined by its set 
of edges E. 


EXAMPLE 4.38. If n = 4 and E = {23,34} C (4), then V(E) = {2,3, 4}. 


2 3 
1 4 
The edge subgraph G(£) is in shown in gray with dashed edges. << 


Given FC (I) and a generic framework q € (R“)", we let qz be the elements 
of q indexed by V(E). It follows that qg is a generic framework for G(E) in R¢. 
This framework has a rigidity matrix Re:z),q, that relates to Rg as follows (we 
omit the straightforward proof). 


LEMMA 4.39. After suitably rearranging the columns of Rg, we have 
[rows of Rg indexed by E] = [Re(z),qz | 0 |. 


Thus Rg contains the rigidity matrices of all edge subgraphs G(E) of K,. But 
which of these correspond to independent sets in Ra(n)? 


Two Theorems. We next state two basic results about the independent sets 
E € Ra(n). We refer the reader to [317| Theorems 3.11.2 and 3.11.3] for proofs. 


THEOREM 4.40. If E is independent in Ra(n), then every subset F C E with 


|V(F)| > d satisfies |F| < d|V(F)|— lores 


THEOREM 4.41. Consider EC (8), Then: 
(1) If |V(E)| <d+1, then E is independent in Ra(n). 
(2) If |V(E)| > d+1, then any two of the following imply the third: 
e E is independent in Ra(n). 
e EF is locally rigid. 
© |E| =a\V()|— (%’). 


To understand the difference between |V(£)| < d+ 1 and |V(E)| > d+1, note 
that since q = (qi,---,Qn) is generic, the convex hull of q in R@ is a simplex when 
|\V(E)| < d+ 1, and independence is clear. But things can be more complicated 
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when |V(£)| >d+1. Here, Corollary 4.22] and Theorem [4.32] tell us: 


e Independence implies |E| < d|V(E)| — eo 


e Local rigidity implies |E| > d|V(E)| — (*4'). 
This shows that the the first two bullets in Theorem [4.41] imply the last. The 
pleasant surprise is that any two imply the third. There are many ideas involved 


here, but they interact very nicely. 


The Laman—Pollaczek-Geiringer Theorem. A classic result about the rigidity 
of frameworks in R? was discovered, forgotten, and rediscovered: 


e In 1927, Hilda Pollaczek-Geiringer published Uber die Gliederung ebener 
Fachwerke in Zeitschrift fir Angewandte Mathematik und Mechanik. 

e In 1970, Gerard Laman published On graphs and rigidity of plane skeletal 
structures in the Journal of Engineering Mathematics. 


Laman was unaware of Pollaczek-Geiringer’s earlier paper. Here is the main result 
proved in their papers. 


THEOREM 4.42. If G = (V,E) with |V| > 2, then a generic framework in R? 
for G is minimally locally rigid if and only if 


(4.17) |E| =2|V|—3 and |F| <2|V(F)|—3 for allOAFCE. 


Proor. The theorem is trivial when |V| = 2. We will prove one direction when 
|V| > 3. Suppose that q is generic and minimally locally rigid for G in R?. By 
Theorem [4.32] it is infinitesimally rigid, so that by Theorem [4.27] 


|E| > rank(Re.q) = 2|V| — 3. 


Rows of the rigidity matrix are indexed by edges, so 2|V| — 3 linearly independent 
rows give the rigidity matrix of a subgraph of G. By Theorem [42]] this subgraph 
is infinitesimally rigid, hence locally rigid. Since q is minimal for G, the subgraph 
must equal G, which implies |E| = 2|V| — 3. It follows that E is independent, and 
then subsets 0 4 F C E satisfy (E17) by Theorem [4.40] 

For the other direction, one can give a matroidal proof Theorem 4.2.1]. 
Another approach uses a recursive argument based on Henneberg moves. See, for 
example, Appendix Al. 


For many years, Theorem |4.42] was known as “Laman’s Theorem,” but now 
“Laman—Pollaczek-Geiringer Theorem” is more common. As noted in [16], James 
Clerk Maxwell knew part of the theorem in 1864, when he wrote: 


A frame of s points in a plane requires in general 2s — 3 connecting lines to render 

it stiff. 
The Rigidity Matroid, Definition 2. We next present Ra(n) as a algebraic 
matroid using the affine Cayley-Menger variety °CMy,4,c © C(2). Recall that this 
is the Zariski closure of the image of the length-squared map ¢x,, 4: (R“)" > RO), 

Using coordinates ¢;;, 77 € (2), on c), °CM a,c is defined by a prime ideal 
Rak = Cley | We (I). Its field of rational functions is R/Ppa @ K"?. 
Thus K"¢ is generated over C by the cosets of the @;; in R/ Pha. 

We now give a second definition of Ra(n). 
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DEFINITION 4.43. The rigidity matroid Ra(n) is the algebraic matroid whose 
independent sets are subsets E’ C ) such that either of the following equivalent 
conditions holds: 


(1) The @,, i7 € E, are algebraically independent over C as rational functions on 
°CM n.4,C; or 
(2) PraNC[éy | 7 € E] = {0}. 


The equivalence follows because (2) gives C[é;; | ij € E] @ R/Pnig A K™4. 
As a consequence, we can compute the independent sets of Ra(n) by elimination 
theory (though this is wildly impractical except in very simple examples). 


Equivalence of Definitions 1 and 2. Here is the theorem we expect. 


THEOREM 4.44. The linear matroid definition of Ra(n) via the rigidity matrix 
Rg is equivalent to the algebraic matroid definition of Ra(n) via the Cayley-Menger 
vartety °CM na,c- 


ProoF. The construction of "CM a,c implies that the complex version of the 
length-squared map 


lx.a i (C4)” — *OMnae C c) 


is dominating. At a generic point q € (C?%)", the matrix of $JlK,,,a(4) is the 
rigidity matrix Rg used in the linear matroid definition of Ra(n). From here, the 
desired equivalence follows without difficulty. 


Earlier, we used the linear matroid interpretation of Ra(n) in our discussion 
of the Laman-—Pollaczek-Geiringer Theorem. We next give an application of the 
algebraic matroid interpretation of Ra(n) to algebraic statistics, following Gross 
and Sullivant [180]. But first, we need to review some basic statistics. 


Gaussian Normal Distributions. The statistical models encountered so far in 
this book have been discrete models. We now turn to a class of continuous models 
based on the Gaussian normal distribution N (0,07), where the probability that a 
random variable X € R lies the interval [a, b] is given by the classic formula 


1 
e 
V210 
_ 


—- 
density function 


1,27 2 
—g2°/o daz 


pasxsy= 


We generalize this to n dimensions as follows: 


e X = (Xj,...,X,) is a random variable in R” distributed according to 
N(0,™), the multivariable Gaussian normal distribution with mean 0 € R” 
and covariance matrix X. 

e Nisan xn symmetric positive definite matrix (when n = 1,¥ = (07)). 
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e The density function for x € R” is 


= 1 x 1,ty-l 
ie) = Gayracrs P| gees a) 


The covariance matrix © is the parameter that defines the model, and its inverse 
=~! is usually called the concentration matriz. 


Gaussian Graphical Models. Suppose that we have a graph G = (V,£) with 
V = [n] as usual. 


DEFINITION 4.45. A Gaussian graphical model with graph G is a Gaussian 
normal distribution N(0, ©) whose concentration matrix U7! satisfies 


(4.18) (o-') =0 fori Aj and ij ¢ EL. 


The vanishing condition says that when 7 4 j and ij ¢ E, the random variables 
X; and X; are conditionally independent relative to the remaining variables. 


Maximum Likelihood Estimates. In the discrete statistical models considered 
in Sections2.2]andB.1] the maximum likelihood estimate played an important role. 
Recall that the MLE is (roughly speaking) the model that best explains the data. 

To apply this idea to a Gaussian normal distribution M’(0, ©), suppose we have 
observations X),..., X( € R” with mean 0. Then the maximum likelihood esti- 
mate is the choice © of the parameter © that maximizes the log-likelihood function 
for the given data. We explain this as follows. The data gives the sample covariance 
matrix 


d 
(4.19) S= sy) xXO(xOy, 
i=l 


This is a positive semidefinite n x n symmetric matrix of rank < d since it is the 
sum of d rank 1 matrices X(X)*, In terms of S$, the log-likelihood is 


— log det(=) — tr(S=~*). 


Now suppose that our model is unconstrained, i.e., © is allowed to be any n x n 
positive definite matrix. If in addition d > n, i.e., if the number of observations 
is at least the dimension, then with probability 1, the sample covariance matrix 
S has rank n and hence is positive definite. In this situation, one can show that 
»% = S maximizes the log-likelihood function. Thus, in the unconstrained setting, 
the MLE exists with probability 1 when d > n. 

However, when we are dealing with a Gaussian graphical model for G = (V, £), 
the covariance matrix ¥ is constrained to satisfy (©~');; = 0 when i 4 j and ij ¢ E. 
Hence the above analysis no longer applies. 


The MLE of a Gaussian Graphical Model. Given G = (V, F) and observations 
XM,...,X© € R”, the restriction on U~! means that in the sample covariance 
matrix S from (£19), only the entries $;; with i = j or ij € E have any meaning. 
This leads to the partial matrix 


Sa = (Sy )i=j or Hen: 
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EXAMPLE 4.46. Here is a graph G and possible sample covariance matrix S: 
2? -9 
97 
1 9 
9 1 
The question marks indicate numbers that mean nothing for the model. They 


correspond to the diagonals of the square, which are not edges of the graph. We 
obtain Sg by removing these entries. << 


to 
Ei 
& w 

WD 

II 
‘iiceou 
ie — 
~oOrR wo 


Using partial matrices, we can describe when the MLE exists as follows. 


THEOREM 4.47. In the Gaussian graphical model for G = ([n], E) and data 
with sample covariance matrix S, the MLE exists if and only if there is a positive 
definite % satisfying Ue = Se. 


We learned about this standard result from Uhler’s 2012 paper Geometry of 
mazimum likelihood estimation in Gaussian graphical models Theorem 2.1]. 
The matrix © in the statement of the theorem is not unique—to get the MLE ys, 
let © be the © with maximal determinant that satisfies Ng = Sg. One can prove 
that this maximum exists, is positive definite, and satisfies (4.18). 

When d > n, S is positive definite with probability 1 and we can take & = $ 
in Theorem |4.47]in this case. Hence we have proved the following corollary. 


COROLLARY 4.48. In the Gaussian graphical model for G = ([n], E), the MLE 
exists with probability 1 for d observations when d > n. 


Gaussian Graphical Models and Rigidity. The theory of Gaussian graphical 
models is very nice, but how does it relate to rigidity? One unexpected connection 
is the following theorem of Gross and Sullivant from 2018 [180] Theorem 1.2]. 


THEOREM 4.49. In the Gaussian graphical model for G = ({n], E), the MLE 
exists with probability 1 for d observations when E is an independent set in the 
rigidity matroid Ra_1(n). 


This theorem is really a result about matroids. In a Gaussian graphical model, 
we have d observations in n-dimensional space, while Rg_1(n) is determined by the 
rigidity matrix of a framework with n points in (d— 1)-dimensional space. In what 
follows, we first discuss a theorem of Uhler that relates existence of the MLE 
to an algebraic matroid built from symmetric matrices and then explain how Gross 
and Sullivant relate this matroid to Rg_1(n). 


The Symmetric Matroid and Uhler’s Theorem. By (419), d observations 
xX®,...,X €R” give the n x n sample covariance matrix 


d 
1 : 
= 5) xO(x) 
s d 2 es 


which is symmetric positive semidefinite of rank < d. This gives an affine variety 


over k = Q, R or C as follows. Introduce Ce) variables x;; for 1 <i<j<non 


the affine space BCE), and let 


Ln ,d,k Cc kay | 1 < a <j < n| 
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be the ideal of generated by all (d+ 1)x(d+ 1) minors of the n x n symmetric 
matrix built from the x4: 


T11  %12  %130«°**~— Lin 
T12 22 23 *** Lan 
T%13 %23 %33 *'*  U3n 
Lin T2n X3n a Inn 


It is well-known that I;,,¢,, is a prime ideal. 
We will consistently abuse notation and regard n x n symmetric matrices as 
n+1 
elements of k("2"). Under this identification, the affine variety 
n+1 
Symy ak = VUn,an) © Ka") 
consists of all n x n symmetric matrices of rank < d with entries in k. In practice, 
k = Q for symbolic computation, k = R for many applications, and k = C for 
algebraic geometry. 
The variables x;; give rational functions on the irreducible variety Symg ,, ;,- 
This is exactly the information needed to define an algebraic matroid. 


DEFINITION 4.50. The symmetric matroid Sq(n) is the algebraic matroid of 
the variety Sym,,,, relative to the rational functions 2;;. 


This matroid is independent of k, which justifies the notation Sg(n). In 2012, 
Uhler proved the following [849] Theorem 3.3]. 


THEOREM 4.51. For G = (V,E) with V = [n], the MLE exists with probability 1 
for d observations in a Gaussian graphical model N(0,%) for G when either of the 
following equivalent conditions are satisfied: 


(1) Inak Vk[tu, ry |tEV, 4 € E] = {0}, or 
(2) {tu, ty |t€ V, y € E} is an independent set in the symmetric matroid Sa(n). 
PROOF. The equivalence of (1) and (2) follows because (1) gives the inclusion 
Kilt, ty |t€ Vi € BE] —> klatu, ty|/Inax- 


The variables x;; for 7 € V and x, for 7j € EF correspond precisely to the entries of 
the partial matrix Sg. 

When k = R, these variables give coordinates on RY+, and the map sending 
a symmetric matrix S' to its partial matrix S¢ is represented by the projection 

TG: RC?) —3 RV+E, 
n+1 

We can interpret Theorem [4.49] in terms of 7g as follows. Let PD, C R("2') be 
the cone of n X n symmetric positive definite matrices. Its image under 7q@ is the 
cone of sufficient statistics 


Ca = ta(PD,) Cc RVt. 


For a sample covariance matrix S$, the condition in Theorem [4.49] that Sg = U¢ 
for some © € PD,, is equivalent to saying 


Sa € Ca. 
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We now prove the theorem. Since the MLE exists with probability 1 when 
d > n (Corollary (4.48), we can assume d < n. Our proof will use standard facts 
about semialgebraic sets. A good reference is {93}. 

When S comes from d observations, it lies in the cone PSD,,4 of positive 
semidefinite matrices of rank < d, so that 


Sgce€V= ™a(PSDn,a) ce RY+, 


Since PSD,,,¢ is semialgebraic (defined by polynomial equalities and inequalities), 
the same is true for V by the Tarski-Seidenberg Theorem Corollary 2.4]. 
Let’s study how Y interacts with Cg. Since PSDy¢ C PD, we have 


VC ma(PD») € ta(PDn) = Ca =CGU We. 
Here, (---) is topological closure and 0(---) is topological boundary. Hence 
(4.20) V=(VNCe)UVN We). 


This union is disjoint since Cg is open (being a projection of the open cone PD,,). 

Our hypothesis involves the ideal J, ¢,2 defining Sym,,_ dR: The relation between 
this variety and PSD,,,a is easy to describe: we have an inclusion PSDpja © Sym,,, aR 
and Proposition 5.1.1] easily implies that 


(4.21) PSD, is Zariski dense in Sym, q.p- 


This is what happens in R("2'). But what about in RV+#? We claim that 
(4.22) V is Zariski dense in RYT”, 


To see why, suppose that f € Ri[xy,v | i € V,i7 € E] vanishes on VY. Then ef 


regarded as a polynomial function on RCo ), vanishes on PSD»,a, which by (4 
implies that f vanishes on Sym,, 7.2. But Sym, ap is Zariski dense in oe so 
the Nullstellensatz implies that f € In,a,c since In,a,c is prime. Thus 


feiacn Canty |\te Vy € 2) = {0}, 


where the last equality is hypothesis (1) of the theorem with & = C. This completes 
the proof of (422). 

Let N = |V| + |E| be the dimension of RY+”. Since a ha ee set and 
its Zariski closure a the same dimension [93] Theorem 3.20], 2) implies that 
dim V = N. By [93] Proposition 3.15], we conclude that the Nee interior V° 
is a nonempty open set of RY +”. 

The final fact we need is that since Cg is open and semialgebraic in RY+”, 
its boundary OCg@ is semialgebraic of dimension < N by Corollary 2.5 and 
Proposition 3.16]. 

The theorem now follows. The previous two paragraphs imply that V° C 0Cg 
is impossible, so that by (420), V° MN Cg is a nonempty open subset of RY+”. It 
follows that for Lebesgue measure on RY+”, Y Cg has positive measure while 
YVM0Cc measure zero since dim0Cg < N. By (£20), the partial matrix Sg € V 
lies in Cg with probability 1, and the theorem is proved. 


The intersection In,4,c NC[xy, vy |i € V, ij € E] is an example of an elimination 
ideal. We saw in Section[L.JJhow such ideals were present at the birth of elimination 
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theory and are central to the modern approach [104] Chapter 3]. One fun aspect 
of the above proof is how this classical idea interacts with semialgebraic geometry. 


Back to the Rigidity Matroid. The symmetric matroid Sa(n) is nice, but our 
real object of study is the rigidity matroid Ra(n). The relation between these 
matroids was discovered by Gross and Sullivant in 2014. The key idea is to remove 
the diagonal. 

By definition, Sy(n) is the algebraic matroid of Sym,(n) relative to the xj; for 
1<i<j<n. One can show that 

Diag = {24 | 7 € [n]} 

is independent in Sg(n). The contraction matroid Sq(n)/Diag has ground set 2; 
for 1 <i<j<n,and given AC {ay |1<i<j <n}, we have 
(4.23) A is independent in Sq(n)/Diag <= Diag U A is independent in Sq(n). 
Here is a key result proved by Gross and Sullivant [180] Theorem 2.4]. We refer to 
their paper for the proof. 

THEOREM 4.52. The obvious bijection {xj |1<i<j<n}o (i) induces an 
isomorphism of matroids Sq(n)/Diag ~ Ra_-1(n). 

We now prove the theorem of Gross and Sullivant stated earlier in the section 
as Theorem [4.49} 

E independent in Rg_i(n) = > {x | ij € E} independent in Sq(n)/Diag 
<> DiagU {x, | 7 € E} independent in Sy(n) 
<=> for G = ([n], E), the MLE exists with 

probability 1 for d observations. 


The first equivalence is Theorem [4.52] the second is (4.23), and finally the third is 
Theorem [4.51] 


REMARK 4.53. Theorem [4.52] is not the first relation we have seen between 
rigidity and symmetric matrices. Recall that the proof of Theorem [4.34] used the 
isomorphism (411). At the level of affine varieties, the isomorphism becomes 


(4.24) 


Sym,_14c 2% “CMn,ac 


(4.25) t t 

Sa(n = 1) Ra(n) 
In this display, the bottom row indicates the associated algebraic matroids. At first 
glance, this seems incompatible with 


Sa(n)/Diag ~ Ra_i(n) 
from Theorem [4.52] What happens is that the isomorphism (4.25) of varieties 
uses the change of variables s;; = $(Li1 t= ta), which doesn’t preserve the 
ground sets of the two algebraic matroids. In other words, (4.25) does not induce an 


isomorphism of the associated matroids. This is a nice illustration of the importance 
of being able work directly at the level of matroids as in Theorem [4.52] 


Maximum Likelihood Estimate of a Planar Graph. As an application of the 
link (4.24) between MLE and the rigidity matroid, we conclude this section with 
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a surprising result of Gross and Sullivant about the MLE of a Gaussian graphical 
model of a planar graph {180} Corollary 3.11]. 


THEOREM 4.54. Let G = ([n],E) be a planar graph. Then the MLE of the 
Gaussian graphical model for G exists with probability 1 for d = 4 observations. 


Proor. If d = 4 > n, then we are done by Corollary [4.48] Hence we may 
assume n > 4. By adding edges between existing vertices, we get a maximal planar 
graph G' = ([n], £’) with E C E’. We will prove that E’ is independent in the 
rigidity matroid R3(n). Then EF will be independent, which by for d = 4 will 
imply that the MLE for G exists with probability 1 for 4 observations. 

Here are some standard facts about the maximal planar graph G’ = ((n], E’): 


e When regarded as lying in the sphere, maximality implies that every face of 
G’ is a triangle. Then the number of faces f’ and number of edges m’ = |E’| 
satisfy 3f’ = 2m’. 

e Combining this with Euler’s formula n —m’ + f’ = 2, we see that G’ satisfies 
m’ = 3n—6. 

e Maximality also implies that G’ is 3-connected when n > 4, meaning that 
it remains connected after removing any two vertices. See Corollary 
4.4.7] for a proof. 


A 1916 theorem of Steinitz (see Theorem 4.1]) states that every 3-connected 
planar graph can be realized in R? as the edges of a convex polytope. When applied 
to G’, the resulting polytope has triangular faces, which allows us to assume that 
the vertices are generic. Theorem [4.25]implies that the resulting edge framework q 
in R? for G’ is locally rigid. Since it also satisfies the m’ = 3n — 6, Theorem [Z.4]] 
implies that E’ is independent in the rigidity matroid R3(n), as desired. 


To see the power of this theorem, suppose that we have a Gaussian graphical 
model for a planar graph with 10,000 vertices. How many observations d do we 
need for the MLE to exist with probability 1? By Corollary [4.48] we know that 
d = 10,000 will suffice. If instead we use Theorem [4.54] we can let d = 4! 


Final Comments. It can take a while to get used to the language of matroids. Yet 
in this section we have used matroids to link a variety of topics, including rigidity, 
polytopes, real algebraic geometry, planar graphs, and algebraic statistics. 


4.3. Polynomial Methods and Rigidity Theory 


by Jessica Sidman 


In Section [4.2] we saw the Laman—Pollaczek-Geiringer (LPG) theorem, which 
says that in the plane generic rigidity has a combinatorial characterization. Three 
questions follow naturally: 


(1) Can the LPG theorem be generalized to higher dimensions, in particular to 
dimension three where there are potential applications to chemistry, engi- 
neering, and architecture? 

(2) The LPG theorem deals with frameworks that are given by a system of 
distance constraints between pairs of points. Are there similar statements 
for other kinds of constraint systems? 
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(3) What can we say about when genericity fails? In particular, can we use alge- 
braic methods to find intuitive geometric conditions under which genericity 
fails? 


Rigidity Theory in Higher Dimensions. Let us begin by briefly recalling the 
intuition behind the sparsity conditions in the LPG theorem which holds for a graph 
G = (V, E) in the plane where 

(1) [Z| = 2|V|—3 

(2) if V’ C V has at least two elements and E’ is the edge set of the induced 

graph on V’, then |E’| < 2|V’| — 3. 

Each vertex in the plane has two degrees of freedom, and constraints between joints 
can only remove relative motions. The 3-dimensional space of rigid motions of the 
framework as a single block can never be eliminated by adding bars between joints. 
Hence, if the edges in G impose independent conditions there can be at most 2|V|—3 
of them. 

The group of rigid motions of R? has dimension ea so in dimension three 
the space of rigid motions has dimension six. Since each vertex in a graph G has 
three degrees of freedom in space, the natural generalization of the conditions in 
the LPG theorem (also referred to as the Mazwell counting conditions) in R? is 

(1) |E| = 3|V|—6 
(2) if V’ C V has at least two elements, and E’ is the edge set of the induced 
graph on V’, then |E’| < 3|V’| — 6. 
However, the famous “double banana” shows that the Maxwell counting conditions 
do not characterize rigidity in dimension three. 


EXAMPLE 4.55. The double banana depicted below is a graph with 8 vertices 
and 3-8—6 = 18 edges. Each “banana” is the union of two tetrahedra glued along 
a facet. The two bananas are glued to each other at the top and bottom vertices. 

The reader may check that the Maxwell counting conditions are satisfied on all 
induced subgraphs. However, we may hold one banana fixed and rotate the other 
around the dashed line through the vertices at which the bananas are joined. This 
gives a nontrivial motion of the joints in one banana relative to the joints in the 
other one. 


1 << 


The analogue of the Maxwell counting conditions fails in higher dimensions as 
well. For each odd dimension d there exists a graph G with |E| = d|V| — (a) and 
|E"| < d\V’| — 3(“4') for all induced subgraphs that fails to be rigid. Examples of 
such graphs can be constructed as “hyperbananas” as in [80]. 


Question: Is there a combinatorial characterization of generic rigidity in dimension 
three? In higher dimensions? 
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An Introduction to Body-Based Rigidity. Though a combinatorial charac- 
terization of generic rigidity for bar-and-joint frameworks in dimension three and 
higher remains open, results characterizing generic rigidity exist for other types of 
frameworks, in particular body-based frameworks, which we introduce here. 

In the 1980s a combinatorial characterization of rigidity for body-and-bar frame- 
works was given by Tay [842]. A second proof was given by White and Whiteley 
361], who also laid out a way to analyze the combinatorial substructures of such 
frameworks using polynomials. To motivate the definition, consider Example [4.56] 


EXAMPLE 4.56. Let A and B be two polygons in the plane. In the figure below 
A is the triangle and B is the square. We fix A and analyze relative motions. 


= a a 
. i: a 2: a——ee 
If we attach a bar to both Adding a second bar Generically adding the 
A and B, then B can between A and B leaves third bar creates a 
rotate about its point of a shearing motion. rigid structure. 
attachment to the bar and 
also translate in space. > 


In general, we want a mathematical model that describes a situation in which 
we have n d-dimensional rigid bodies connected by a collection of m fixed length 
bars attached at universal joints. In the plane, we imagine the joints as pins about 
which the bodies can rotate. As Example [4.56] shows, in order to eliminate the 
relative motions between just two rigid bodies we will need multiple bars. Hence, it 
is natural to give a body-and-bar framework the underlying combinatorial structure 
of a multigraph. We reprise Defintion from Section in a form commonly 
found in the literature. 


DEFINITION 4.57 ([361]). A d-dimensional body-and-bar framework consists of 
a multigraph G = (V, £) and an element (a¢,b.) € IR? x R®@ for each edge e € E. 
Each vertex represents a body, and each edge e represents a bar with endpoints a¢ 
and be. 


In dimension 3, a famous example of such a structure is a special case of the 
Stewart-Gough mechanism which consists of two metal plates connected by six legs 
attached at ball-and-socket joints. In the full Stewart-Gough platform, the bars 
have adjustable length, but here we consider them fixed so that they represent 
fixed distances between joints. See Examples [4.4] and [4.8] from Section [£1] 

The conditions characterizing generic body-and-bar rigidity given by Tay and 
White and Whiteley are similar to those appearing in the LPG theorem. Again, the 
proof involves linearizing the problem, so the statement is for infinitesimal rigidity. 
Moreover, as we see in Example[4.58] any combinatorial characterization of rigidity 
can only hold for a sufficiently general framework associated to a given multigraph. 


EXAMPLE 4.58. Three bars with sufficiently general attachment points on two 
rigid bodies in the plane eliminate all relative motion. However, as we see below, 
there are special placements of the three bars that allow internal motion of the 
resulting framework. 
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<. —— If the three bars meet at a point, then 
< the square can rotate around the cube. 
ee a If the three bars are parallel, then the 
CS + system admits a shearing motion. <> 


We will return later to a discussion analyzing the geometry of the realizations 
of a framework that give rise to special behaviors, such as Example [4.20] from Sec- 
tion For now it suffices to define what we mean by “generic.” In a 
framework is generic if all of the coordinates of the joints are algebraically inde- 
pendent over Q. Indeed, then we may treat these coordinates as indeterminates (or 
variables), which we denote by x. 

Like the proof of the LPG theorem, the combinatorial characterization of body- 
and-bar rigidity depends on the development of a rigidity matrix, Rg, and is stated 
in terms of infinitesimal motions. 

Let k = () be the dimension of the group of Euclidean motions. We embed 
R? C P4 via (pi,...,pa)  (1,p1,---,;pa) in the usual way and abuse notation 
letting ae = (1,a-). In the linearization of the body-and-bar constraint equations 
given by White and Whiteley, the vector p. of the Pliicker coordinates of the line 
joining ae and be replaces the displacement vectors in the rigidity matrix of the 
bar-and-joint frameworks. 

Given a body-and-bar framework G(p) with n vertices and m edges we form 
an m xX kn rigidity matrix. If e has endpoints 7 and 7 with 7 < j, then the row 
corresponding to e has the form 


(0---O pe 0--- —pe 0---0) 


with pe in the k columns associated to vertex i, —p- in the k columns associated to 
vertex j, and zeroes in all other columns. The elements of the kernel of the rigidity 
matrix are precisely the infinitesimal motions of the framework. 

White and Whiteley work in the more general setting of a k-frame. 


DEFINITION 4.59 (Definition 1.7 in [B61]). A k-frame G(p) consists of a multi- 
graph G = (V,£) and a vector p, in R* for each edge e € E. We let G(x) denote 
the generic k-frame associated to the graph G and interpret x as a vector of inde- 
terminates. 


Note that every body-and-bar framework is a k-frame in which the edge labels 
are Pliicker coordinates of the lines along the bars of the framework. However, not 
all k-frames arise as body-and-bar frameworks. 

We will focus our attention on the generically minimally rigid case where m = 
kn—k. As with bar-and-joint frameworks, we cannot eliminate the “trivial” motions 
of the framework viewed as a rigid body by inserting edges—these edges can only 
eliminate relative motions between joints. We eliminate trivial motions by tying 
down a body, say body one. In terms of the rigidity matrix we eliminate the 
infinitesimal motions associated to body one by adding k rows containing the k x k 
identity matrix J, in the first k columns and zeroes elsewhere. We denote the 
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square matrix with tie-down by Ra,r: 


hors 0 eee Xe cee =e cee 0 


DEFINITION 4.60. A k-frame G(p) is generically infinitesimally rigid if the ker- 
nel of the matrix associated to G(x) contains only the trivial infinitesimal motions. 
It is minimally infinitesimally rigid if the deletion of any edge of G produces a 
nontrivial infinitesimal motion. 


THEOREM 4.61 ([342], ). A generic body-and-bar framework associated to 
a multigraph G(V, E) is minimally infinitesimally rigid in R® if and only if 


(1) |B] = k|V| —k 

(2) If V’ C V is nonempty and E’ C E is the set of edges induced on V’, then 
|E"| < k|V"|—k, 

where k = eae 


DEFINITION 4.62 ( ). Let G be a d-dimensional k-frame with kn—k edges. 
We define C¢(x) = det Re,r to be the pure condition of G. 


It is clear that if G(p) is a k-frame with kn — k edges, then it is minimally 
infinitesimally rigid if and only if the determinant of Rg,r is nonzero. Moreover, 
the pure condition Cg@(x) associated to G(x) is a nonzero polynomial if and only 
if G is minimally infinitesimally rigid. And, if G is generically minimally infinites- 
imally rigid i.e., Cg(x) is nonzero, then G(p) is minimally infinitesimally rigid for 
almost all p because the variety V(Cc) has measure zero. Theorem [4.61] gives a 
combinatorial characterization of this phenomenon. 


THEOREM 4.63 ((842],[361]). A generic body-and bar framework associated to 
a multigraph G(V,E) is minimally infinitesimally rigid in R@ if and only if G can 
be decomposed as a union of k edge-disjoint spanning trees where k = Ce i 

Note that a tree on n vertices has n — 1 edges, so a graph that is the union 
of k edge-disjoint spanning trees has kn — k edges. Hence, we see that if a graph 
has a decomposition into spanning trees as in Theorem [4.63] the first condition in 
Theorem [4.61] is satisfied. 


EXAMPLE 4.64. The simplest nontrivial example of a generically minimally 
rigid body-and-bar framework with three bodies in the plane is given in Example 
3.1 in [B61]. A planar body-and-bar framework is a 3-frame, and we see that if we 
have two edges between each of the three bodies, the edges can be partitioned into 
three disjoint spanning trees (indicated below by dashed, dotted, and solid lines). 
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We also give the rigidity matrix associated to G with vertex two tied down: 


a, a2 a3 | —-ay, =09 —03 0 0 0 
by bg bg | —by —bg —b3 | O 0 0 
Cy C2 C3 0 0 0 —Cy —Cy —C3 
d; dz dg] 0 0 0 |—-d; —dzg —d3 
Rar = 0 0 0 E1 €2 €3 —E€1 —€2 —€3 
D0 OF) Fi Je Fel fe —Fe 
0 O O 1 0 0 0 0 0 
0 O O 0 1 0 0 0 0 
0 O O 0 0 iL 0 0 0 


We compute Cg by expanding the determinant via 3 x 3 minors which we 
denote as a bracket of three vectors. If we start with the expansion using the last 
three rows we see that Cg = [abd]|ce f| — [abc]|de f]. <— 


Note that the LPG theorem can be rephrased so that it is also in terms of trees. 


THEOREM 4.65 ({184],[262]). A generic bar-and-joint framework associated to 
G(V, E) is minimally infinitesimally rigid in R? if and only if adding any edge e to 
G (which may give a multigraph) results in a graph that can be decomposed into 2 
edge-disjoint spanning trees. 


We will prove the more general statement for body-and-cad frameworks, and 
Theorem |4.63] will follow as a special case. The proof is essentially the same as the 
one given in with minor modifications for the more general setting. 


Body-and-Cad Frameworks. In the above discussion we defined body-and-bar 
frameworks which consist of a finite set of rigid bodies and distance constraints 
between points on pairs of bodies. We also saw that from the point of the com- 
binatorial and algebraic analysis we could work more generally with k-frames. In 
this section we will introduce [a, b|-frames, a generalization of the k-frame that is a 
combinatorial model for other geometric constraints that can be imposed between 
a pair of rigid bodies. 

Given a pair of full-dimensional rigid bodies in R@ we specify geometric con- 
straints between them by affixing affine linear spaces of dimension 0,...,d—1 to 
them and imposing coincidence, angle, or distance constraints between these linear 
spaces. For example, a “bar” in a body-and-bar framework is a distance constraint 
between two 0-dimensional linear spaces. 

We give examples of other cad constraints in the plane below. 
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EXAMPLE 4.66. In the plane a constraint requiring two lines to coincide removes 
two degrees of freedom. Here we affix a line to each polygon and require the lines 
(dashed in the figure) to coincide. If we fix the triangle, then the square can 
translate horizontally but cannot rotate or translate in any other direction. 


-- 6 eee eee -- << 


EXAMPLE 4.67. An angle constraint between two lines in the plane restricts 
only one degree of freedom. The full 2-dimensional space of translations remains 
unrestricted by the requirement that the two dashed lines stay perpendicular. 


1 << 


The authors Haller et al. laid out a rigidity theory for cad (coincidence, 
angle, and distance) constraints with nine constraints in dimension two and 27 in 
dimension three. As in the body-and-bar setting, it is possible to linearize and 
work in the infinitesimal setting. We refer the reader interested in the derivation 
of a combinatorial model and rigidity matrix for body-and-cad frameworks to the 
original paper or to [330]. The reader interested in the rigidity theory of 
other constraint systems may orient themselves by consulting and [219]. 

We ask the same question as in the two previous cases, does a given set of 
constraints specify a structure that is generically infinitesimally rigid? 

Before giving formal definitions it is important to note two key differences. 
First, a single geometric constraint may eliminate more than one degree of freedom 
as we saw in Example[4.66] Additionally, some geometric constraints only remove 
rotational degrees of freedom. For example, a system of constraints containing 
only line-line angle constraints cannot eliminate translational motions between the 
bodies. This is what happened in Example [4.67] 

This information can be summarized in a primitive cad graph. This is a multi- 
graph H = (V, RUB) with one vertex for each rigid body. Each geometric constraint 
gives rise to one edge for each degree of freedom constrained by it, and the edges are 
partitioned into two sets, R (red) and B (black), with the red edges corresponding 
to constraints that only restrict rotational degrees of freedom. In dimension two 
there is a single angle degree of freedom, and in dimension three there are three an- 
gle degrees of freedom. As a consequence, some constraints are generically nonzero 
in all k-coordinates while others are forced to have zeroes in some coordinates, even 
generically. 

The definition of a body-and-cad framework, together with all of the possible 
constraints is given in [185]. For our purposes, the algebraic and combinatorial 
analysis of the linearized constraints, it suffices to define the purely combinatorial 
object, an [a, b]-frame, which allows us to keep track of angle constraints separately 
from “blind” constraints which are generically nonzero in all coordinates. 
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DEFINITION 4.68. An [a, b]-frame G(x) is a bicolored multigraph G = (V, RUB) 
together with a vector (x-) of length k = a+ b for each edge, where the vectors 
labeling red edges have zeroes in the last b coordinates. 


If G(x) is an [a,b]-frame associated to a body-and-cad framework, then the 
underlying graph G is its primitive cad graph. Note that Definition follows 
which differs slightly from [254]. 

We will use the body-and-cad framework in Example [4.69] which is the case 
study in [144], to illustrate the theory throughout this section. 


EXAMPLE 4.69. Let G(x) be the [1,2]-frame associated to the body-and-cad 
framework consisting of a line-line coincidence constraint between bodies 1 and 2, 
a line-line coincidence constraint between bodies 1 and 3, and two point-point 
distance constraints between bodies 2 and 3 depicted below on the left, and the 
primitive cad graph G is on the right. 


By Example [4.66] the line-line coincidence for bodies 1 and 2 restricts two degrees 
of freedom, one of which is rotational. This gives two edges in the graph G, with 
one colored red. The same holds for bodies 1 and 3. <— 


DEFINITION 4.70. G = (V, RU B) is an |a, b]-graph if 3 B’ C B such that 


(1) (V, RU B’) is the disjoint union of a spanning trees 
(2) (V, B \ B’) is the disjoint union of b spanning trees 


In Example [4.69] we see that G is actually a [1,2]-graph as the red edges form 
a spanning tree, and the remaining edges decompose into two other spanning trees 
(thus B’ = —) in this case in Definition [Z.70). Note that a body-and-bar framework 
is a special case of a body-and-cad framework, so it is possible to eliminate all 
of the relative motions of a body-and-cad framework with point-point distance 
constraints (bars). Thus, there are generically rigid body-and cad frameworks with 
no red edges. However, as we will see in the proof of Theorem |4.73} if we do have 
red edges then they will affect how we decompose G' into spanning trees. 

Associated to an [a, b]-frame G(x) is a rigidity matrix Rg. As with the rigidity 
matrix of a k-frame, there are k columns per vertex. For each edge e from vertex i 
to vertex j with 7 < j, x. appears in the & columns associated to vertex i and —x, 
appears in the k columns associated to vertex 7. As before, we can eliminate the 
trivial motions of the system by tying down the 7th body. In the rigidity matrix 
this corresponds to appending k rows with a k x k identity matrix in the 7th group 
of k columns and zeroes everywhere else. 
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If the [a, b|-frame G(x) has kn — k edges, then tying down a body results in 
squaring off the rigidity matrix and allows us to define a pure condition for {a, b]- 
frames. 


DEFINITION 4.71. Given an [a, b]-frame G(x) with kn — k edges, its pure con- 
dition Cg is the determinant of Rg r. 


EXAMPLE 4.72. The tied-down rigidity matrix of the [1,2]-frame shown in 
Example [4.69] is 


a, ag a3 |—ay; —-ag —az 0 0 0 
b, O O | —by 0 0 0 0 0 
oc 8 0 0 0 O | -—cy 0 0 
d, dz d3| O 0 —-d, -—dz —ds 
(4.26) Rear = 0 0 0 ial €2 €3 —€] —=€9 ~-—€3 
0 0. OO) f& gf gs |=. “Je je 
0 O O 1 0 0 0 0 0 
0 O O 0 1 0 0 0 0 
0 O O 0 0) 1 0 0 0 


Red rows correspond to red edges in the graph, and being a [1, 2|-frame gives extra 
zeros in the red rows. The pure condition is the determinant, which simplifies to 


a, a2 a3 O 0 0 
b, O O 0 0 0 
Ci 0 0 =C] 0 0 
dy dy ds —d, —do —ds 
0 0 0 —E} =69 =€3) 
OO 0. =f gfe =fa 
= [abc][def| — [abd][ce f] 

= —[abd][cef] 


= bi (a2d3 — agd2)ci (eo fs — e3 fe). 


Interestingly, the two-term polynomial written in brackets is irreducible if the vec- 
tors are all generic 3-vectors. However, since b and c correspond to angle constraints, 
the first product of brackets vanishes. Moreover, the second product factors further 
because b and c correspond to angle constraints. << 


Co = det Rar = det 


THEOREM 4.73. The pure condition of a generic [a, b|-frame G(x) is nonzero 
if and only if it is an [a, b]-graph. 


We follow the proof of Theorem 2.18 in closely, modifying it to accom- 
modate the extra zeroes in the matrix in the rows corresponding to red edges. 
The direction showing Cg nonzero implies that G is an [a, b]-graph corrects what 
appears in [254]. The key to the proof is that the determinant of Rg,r can be 
expanded via n x n minors or /& x k minors in natural ways. 


PROOF. First we will show that if Cg(x) is nonzero, then the edges of G can 
be partitioned into k spanning trees as in Definition [4.70] Permute the columns of 
Rar so that they are arranged in k groups of n, with all of the first coordinates of 
the vectors x. in order, followed by all of the second coordinates, etc. 
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In the first a consecutive groupings of n columns we have a matrix whose first 
kn — k rows are nonzero scalar multiples of the rows of the incidence matrix for the 
graph G with edges directed from low vertices to higher ones. In the last b groupings 
of n columns the rows are nonzero scalar multiples of the incidence matrix for the 
graph G minus all of its red edges (since red rows in these groupings are all zero). 

For example, when we rearrange the columns of the matrix (426), the first six 
rows illustrate the structure just described: 


ay —ay, 0 ag —dag 0 a3 —a3 0 
by —by 0 0 0 0 0 0 0 
ce O =c,| 0 0 0 0 0 0 
d, O  —-d,|dz O -dzg|d3  O0 —dg3 
(4.27) 0 Ee, =E] 0 e€2 =€9 0 €3 —=€3 
O ff =f | 0. =—2|90 ff =< 
0 1 0 0 0 0 0 0 0 
0 0 0 0 1 0 0 0 0 
0 0 0 0 0 0 0 1 0 


If we expand this determinant via blocks of n x n minors, the determinant is a 
sum of products II*_, det M; where each M; is a subset of n rows in the ith block 
of columns chosen so that the union of the M; partition the rows of Re. 

Since Cg is nonzero, at least one of these products must be nonzero. For such a 
product, each det M; is nonzero. This means that the rows of MM; must not contain 
a cycle, and since we have chosen n rows, they must correspond to the n — 1 edges 
of a tree and a tie-down row. 

If 1 <i <a, then the rows in M; correspond to spanning trees that may contain 
red edges (together with some subset of black edges). The last b of the M; cannot 
contain rows corresponding to red edges because these rows are entirely zero, as 
in (£27). Therefore, we have found a decomposition of G into k spanning trees, a 
of which may contain red edges, and b that contain only black edges. 

For the reverse direction we need to prove that if G is an [a,b|-graph, then 
the determinant of Rg,r is nonzero. The plan is to produce a specialization of 
the generic edge labels at which Cg is nonzero, showing that it must be a nonzero 
polynomial. 

Since G is an [a, b]-graph we have a trees that may contain red edges and b all 
black trees. Order them in such a way that the trees with red edges come first. 

In each tree there is a unique path from any vertex v to the tie-down. Orient 
the edges of G so that all of these paths are directed towards the tie-down. Each 
of the remaining n — 1 vertices has exactly k outgoing edges, one in each tree. The 
determinant of Rg 7 can be expanded as a sum of products of k x k minors in a 
natural way. As shown in the proof of Theorem 2.18 in [361], one of these, P, is 
the product of & x k minors consisting exactly of the rows corresponding to the k 
outgoing edges from each vertex, one from each tree 

Now we give the edge labeling. Give each edge in tree i the same label, 
(%i1,--+-,i,n), Where the last 6 coordinates are zero for labels of red edges. If 
we evaluate P at this specialization of the edge labels, we see that we get a prod- 
uct of nonzero determinants. However, all of the other products of minors in the 
expansion of Cg must be zero. Indeed, to form any k x k minor we must choose 
k; edges incident to each vertex. Any other choice requires that for some vertex we 
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choose two edges from the same tree, and since these edges have the same labeling 
in our specialization, that minor must be zero. 


As a consequence of Theorem [4.73] we get a combinatorial characterization of 
generic minimal infinitesimal rigidity for body-and-cad frameworks. 


THEOREM 4.74 (Lee-St.John—-S [254]). A body-and-cad framework is generi- 
cally minimally infinitesimally rigid if and only if it is a 
(1) [1,2]-graph in 2D. 
(2) [8,3]-graph in 8D (point-point coincidences are omitted). 


Interestingly, we must omit point-point coincidences from our constraint sys- 
tems because the corresponding equations have symmetries that prevent us from 
incorporating them into the general theory above. 


Nongeneric Frameworks. As we saw earlier, a k-frame G(x) is generically 
minimally infinitesimally rigid if and only if Ce¢(x) is a nonzero polynomial. If 
p ¢ V(Ca), then G(p) is minimally infinitesimally rigid. Hence, almost all realiza- 
tions of G have the generic behavior. Given this, it is natural to ask why we should 
study nongeneric frameworks. 

The importance of nongeneric frameworks depends on the application. If the 
object of interest is truly a random realization of a given generic k-frame, then 
the combinatorial analysis suffices. However, frameworks designed to be useful are 
not random realizations of their combinatorics. They tend to have symmetries, 
repeated geometric patterns, and nongeneric parameters. Additionally, studying 
the nongeneric configurations of a framework allows us to deepen and complete our 
understanding of a framework as pure mathematicians. 

We take the point of view of White and Whiteley in and study nongeneric 
infinitesimal behavior by studying the pure condition Cg. This line of inquiry 
has two main branches. One is to use factors of Cq@ to understand important 
substructures of G. The other is to interpret the vanishing of a factor with an 
intuitive geometric description of the corresponding special positions. 

To provide some intuition we return to Example [4.58] 


EXAMPLE 4.75. The corresponding 3-frame G(x) = G(a, b,c) is depicted below. 


a 


@) 


c 


The pure condition Cg = [abc] vanishes whenever the points a, b, and c are collinear 
in the projective plane. But these points are the Plticker coordinates of the lines 
along the three bars, so in the dual projective space this corresponds to the condition 
that the three lines meet at a point. Here, we see an advantage of working in 
projective space. The two conditions that we identified initially, either the three 
lines are parallel or coincident, can be described by a single geometric condition in 
projective space because the case in which the three lines are parallel is equivalent 
to their common point of intersection lying on the line at infinity. << 


We see that the vanishing of the pure condition in Example [4.75] has a nice 
geometric interpretation—the lines along the three bars are coincident in projective 
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space. This interpretation is easy to decipher because the pure condition is the 
vanishing of a single bracket, and we already knew a geometric interpretation for 
the vanishing of a determinant. 

Interestingly, it is sometimes possible to give similar interpretations, in terms 
of linear spaces meeting unexpectedly, using the Grassmann-Cayley algebra. Given 
a vector space W, let \/ W be the usual exterior algebra on V, though we denote 
the exterior product with a V instead of the usual A. If wy,...,w, € W, then 
w1V-+--V ws is an eztensor, and a typical element of \/ W is a linear combination 
of extensors. Since an extensor is nonzero if and only if w,,...,ws form a basis for 
their span, it is natural to identify an extensor with a subspace of W and the product 
of two extensors that represent subspaces with trivial intersection represents the 
join of the subspaces. 

The reason that we write the exterior product using V instead of A is that the 
Grassmann-Cayley algebra is formed by introducing a second operation on \/ W 
called the meet. The meet of two extensors u and v is a scalar times the extensor 
representing the intersection of the subspaces represented by u and v if this subspace 
has the expected dimension. If the intersection of two subspaces does not have the 
expected dimension, then their meet is zero. 

To give a more precise definition we must first review the notion of a shuffle. 
A shuffle of [s] into sets of size k and s — k is a permutation o € S, satisfying the 
following: a(1) < 0(2) <--: < a(k) and o(k +1) < o(k +2) <--+ < a(s). We let 
sign(o) denote the sign of o as a permutation. 


DEFINITION 4.76. Let W be a vector space of dimension r and u = u, V---V us 
and v = v1 V---V vu be extensors with s +t >r. Then we define 


uAv= S- sign(7)[wo(1) “t+ Ua(r—t)U1*** Ut]Uo(r—t-+1) Voree V Ua(s)> 


where the sum is over all shuffles o that split [s] into sets of size r—t and s—(r-—t). 


For example, suppose that a, b, c,d, e, f are vectors in R?. Then (cVd)A(aVb) = 
[cab]d — [dab|c. Moreover, 


((cV d) A (aV b)) A (eV f) = [cabl[def] — [dab] [ce f] = [abc][def] — [abd] [ce f}. 


The reader should note that [abc][de f] —[abd][ce f] is the pure condition of the body- 
and-bar framework in Example[4.64] If a polynomial in brackets can be written as a 
single term in the Grassmann-Cayley algebra using only meets and joins (without 
summing), then we say that it has a Grassmann-Cayley factorization. Not all 
bracket polynomials have such factorizations, and even when such a factorization 
exists, it may be computationally intensive to produce it. (See for details.) 
However, such a factorization does invite a geometric interpretation of the vanishing 
of the bracket polynomial. 

In the case of Example [4.64] we consider a, b, c,d, e, f to be points in P? and we 
get the condition that the lines ab, cd, and ef meet in a point. Since these vectors 
are the edge labels of a body-and-bar framework, they are the Pliicker coordinates 
of the lines lying along the bars in the framework. To get an interpretation in 
the original projective space we dualize, and ab represents a point where the lines 
represented by a and 6b intersect, etc. Therefore, the condition is that the points 
ab, cd, and ef are collinear. The analysis of the framework in Example [4.64] given 
above appeared in [361], and illustrates one line of analysis of frameworks using the 
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pure condition in which special geometry configurations causing the pure condition 
to vanish can be identified via a Grassmann-Cayley factorization. 

Another line of inquiry involves using irreducible factors of the pure condition to 
identify rigid substructures of a framework. In the case of body-and-bar frameworks 
White and Whiteley lay out a beautiful correspondence between properties 
of the pure condition Cg and properties of the underlying graph G. For example, 

(1) Each edge of G is in the support of exactly on irreducible factor of Co. 
(2) Each factor of Cg is the pure condition of a graph obtained by the deletion 
or contraction of edges of G along with the deletion of any isolated vertices. 

For a general [{a, b]-graph G, it is still the case that each edge is in the support 
of exactly one irreducible factor of Cg. However, irreducible factors of Cg may 
not be the pure condition of any [a, a], [b, b], or [a + b,a + b]-subgraph of G. The 
circuits of body-and-bar frameworks all have the form of a minimally rigid graph 
plus an extra edge, whereas the structure of circuits of [a, b]-graphs is mysterious, 
motivating the problem below. 


Problem: Describe and classify circuits of [a, b|-graphs. 


Before their paper on body-and-bar frameworks, White and Whiteley intro- 
duced a notion of a pure condition for bar-and-joint frameworks in [362]. The pure 
condition for body-and-bar frameworks is presented first here because the corre- 
spondence between the polynomial and the combinatorics of the graph fits together 
so nicely—it really is a model for such a construction. Now we briefly sketch the 
theory for bar-and-joint frameworks, highlighting the differences between the two, 
before ending with a discussion of various ways of using the Cayley-Menger variety 
to study nongeneric realizations of frameworks. 

Let G(x) be a generically minimally infinitesimally rigid bar-and-joint frame- 
work. The pure condition, Cg, is defined as before by adding eae rows to the 
rigidity matrix to eliminate trivial motions of the framework as a rigid unit. How- 
ever, in the case of bar-and-joint frameworks we work in the coordinates of the 
joints, and a single joint only has d coordinates. The interested reader can consult 
White and Whiteley [362], where they describe a general way of fixing joint coor- 
dinates to tie down the framework in a way that works well with the computation 
of the pure condition. 

Instead of giving the general construction here we provide a detailed example. 


EXAMPLE 4.77. Consider the generic framework G(x) associated to the graph 
on the left. We tie down the framework at vertices one and two, where the coordi- 
nates of the first joint will be completely determined by a and b. 

1 


c 2 
5 5 


The rigidity matrix associated to the tie-down is given below. The last three 
rows correspond to the tie-down. There are ten columns total as each column shown 
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represents a two-dimensional vector so is really two columns. 


X,—XQ XKg—-X, 0 0 0 

xX XB 0 >, ee, Sil 0 0 

xX, —X4 0 0 X4— Xj} 0 

0 X2—-X3 XK3— X92 0) 0 
R _ 0 XK2q — X5 0) 0 X5 — XQ 

i 0 0 X3—X4 X4—X3 0 
0 0 0 X4—X5 X5 — X4 

Xj-—a 0 0 0 0 

x1—b 0 0 0 0 

0 X2—C 0 0 0 


We expand det Rg,r via 2 x 2 minors, i.e., we multiply the determinants of two 
rows from the first block of two columns, two rows from the second two columns, 
etc. Note that if we view the joint coordinates as elements of projective space by 
sending (@11,%12) + (1,211,212), we can write a 2 x 2 minor as a 3 x 3 determinant 


in a nice way 
1 ay ag 


det & ~ ; =det|1 0; be 
x; —b 
1 21 Ve 


We will abuse notation and write the determinant above as the bracket [a, b, x]. 
Hence, det Re,r = [a, b, x1][X1, ¢, X2][X1, X2, X3][X1, X3, X4][X2, X4,X5]. White and 
Whiteley show that with such a choice of tie down, the factors involving the tie- 
down variables are irrelevant, and we can define the pure condition of G to be 
Cg => [x1, X2, x3] [x1, X3, x4] [xo, X4, X5]- 

Here, the vanishing of Cg has a natural geometric interpretation. Each factor 
is a 3 x 3 minor which vanishes when the corresponding three points are collinear. 
So, the framework has infinitesimal motions when any one of the three triangles, 
{1, 2,3}, {1,3,4}, or {2,4,5} degenerates. << 


At a point p such that C¢(p) = 0, there must be a linear dependence on the 
rows of the rigidity matrix Rg(p). A nonzero dependence relation on the rows of 
Re is called an equilibrium stress, and the support of the stress is the set of edges 
corresponding to rows with nonzero coefficient in the stress. The stress coefficients 
have a physical interpretation, so it is important to know the subgraph where a 
stress is supported. 

Interestingly, the support of a stress associated to a given irreducible factor of 
Ce need not be related in obvious way to the structure of the factor as the example 
below shows. 


EXAMPLE 4.78. Continuing along from Example [4.77] note that in this case 
Ca factors as the product of three brackets, each of which vanishes when a triangle 
of three points degenerates to a line. Two of the triangles, on vertices {1, 2,3} and 
{1,3,4} are subgraphs of G, and each of these supports a stress. However, the 
rigidity matrix for the induced subgraph on vertices {2, 4,5} is 


0 X2 — X5 0 0 X5 — XQ 
0 0 QO X4—X5 X5—X4 


? 


and we can see by inspection that these rows cannot support a stress alone. <> 
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As Example [4.78] shows, irreducible factors of the pure condition can have 
the same algebraic structure while the combinatorics of the associated stresses is 
very different. To understand where the stresses associated to the vanishing of an 
irreducible factor of the pure condition are supported, we will a. to the Cayley- 
Menger variety, which first made its appearance in Section [4.1] The advantage of 
working with the Cayley-Menger variety is that its defining equations are given in 
terms of the (squared) distances, which correspond to edges in the graph of the 
framework. Hence, the support of a polynomial naturally corresponds to a unique 
subgraph. 

Recall that if we have n points in R? then we can define y : (R*)” > R@) 
by pij(x) = (xi — x;)- (ki — x;) = 4, whose output is a vector of (squared) 
distances between pairs of points. The map y was denoted @x, q in Section 
Since we will use algebraic techniques, we regard y as a map over C, and we let 
yp* : C[é,;] + C[x] be the map on rings corresponding to the length map. Given a 
graph G on n vertices, we can follow this map by the projection 7g : c) + C* 
given by the edges FE of G. Function composition gives us the triangle of maps: 


p 
(C4)") ———> *CMnac CCU) 


\ P 


Here, ker (y*) is the ideal of the Cayley-Menger variety “CM y,,a,c and yq is the 
map cq from Section [4.1] A version of this diagram appeared as (4.14). 

Each element of the ideal of the Cayley-Menger variety gives rise to a stress 
via the chain rule. 


LEMMA 4.79. If f € ker(y*), then Vf is a stress whose support is contained 
in the support of f. 


Proor. By definition of ker (y*), f(y(x)) is identically zero. Hence, 
A f(y(x))] = VF(e(x)) - de = 0. 


Up to a constant, dy is the rigidity matrix for the complete graph on n vertices 
(see (4.7)). Hence the equation above shows that Vf is a stress. It is clear that the 
support of Vf is contained in the support of f. 


Via Lemma [4.79] we can see how to find a stress associated to each factor of 
the pure condition. 


THEOREM 4.80 (Rosen—S—Theran-Vinzant). Let G = (V,E) be a minimally 
rigid graph in dimension d. If g is an irreducible factor of Cg, then 
(1) P= (y*)~*((g)) is prime. 
(3) Vf is an equilibrium stress for frameworks in V(qg). 


EXAMPLE 4.81. Continuing on with Example [4.77] using MACAULAY2 we can 
see that the defining equation of yg(V([x1, X2,X3])) is given by 


024 + Wiolis — O25 + Wioles + 2lizlos — Bs, 
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which (up to sign) is the Cayley-Menger determinant from Example [4.26] 


0 1 1 1 
1 0 fe &s 


det. 


This is the Cayley condition that a triple of real numbers must satisfy if they 
are the pairwise distances among 3 collinear points. The defining equation of 
ya(V ([x1, X3,X4])) is analogous. On the other hand, the defining equation of 
pae(V ([X2,X4,Xs5])) is an irreducible polynomial of degree 6 supported on every 
edge of G. <— 


Motions of Frameworks. In the above discussion we saw how polynomials in 
the ideal of the Cayley-Menger variety give stresses on a framework. This is part 
of the infinitesimal analysis of rigidity. It is natural to ask if there is a way to use 
any of the algebra that we have set up to detect frameworks in positions where 
they actually move. This is the focus of ongoing work between the author and 
collaborators Zvi Rosen, Louis Theran, and Cynthia Vinzant. 

We leave the reader with an example of a procedure for detecting motions: 


EXAMPLE 4.82 (Rosen—S—Theran-Vinzant). Let G = (V,F) be the graph in 
the plane given below. The idea is that since G is generically minimally rigid, E is 
a maximal independent set in the algebraic matroid associated to ker (y*), which 
is the rigidity matroid R2(5). If we add in a non-edge e, then G+ e must contain 
a circuit. We can view the associated circuit polynomial f, as a polynomial in the 
variable e. For a general choice of the other edge lengths, we get finitely many 
possibilities for e. Letting I be the ideal of the coefficients of f., we see that the 
variety defined by J consists of edge lengths for which there are no conditions on e, 
so there must be infinitely many possibilities for it. Hence, a framework with these 
edge lengths has realizations that have motions. 


e Add in edge e = 35. 

e G+e contains a circuit. 

Get circuit polynomial p of degree 
4 in each variable. 

I = (coefficients of p(e)). 

e I has 6 associated primes. 


The two that are geometrically significant for motions are 
(€13, £14 — €34, £12 — £23), (l23 — £34, f12 — £14, —L45 + £25). 


What we see is that if vertices 1 and 3 coincide, ¢)4 = ¢34, and 12 = ¢23 then a 
motion is possible. A realization of such a graph looks like a 4-bar mechanism. 
The reader may check that the other ideal allows the graph to be realized with 
vertices 2 and 4 coinciding. This allows the doubled edge from vertices 2 and 4 to 
vertex 5 to rotate while the triangle is held fixed. <- 
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Conclusion. Rigidity theory is an area that is ripe for modern algebraic methods. 
The infinitesimal combinatorial theory developed as a way to avoid working with 
the nonlinear (typically quadratic) polynomials that encode constraints. However, 
as we have seen, polynomials re-emerge in the combinatorial setting through the 
White-Whiteley pure condition. Moreover, working with polynomials in the length 
space and the Cayley-Menger variety presents new viewpoints and connections. 


CHAPTER 5 


Chemical Reaction Neworks 


A network of chemical or biochemical reactions gives a system of differential 
equations by the Law of Mass Action. These systems have a lovely structure whose 
analysis involves graph theory and toric varieties. 


5.1. The Classical Theory of Chemical Reactions 


Our first goal is to make the transition from chemistry to mathematics. The 
starting point is the Law of Mass Action. 


Law of Mass Action. Let’s jump right in with a simple example. 


EXAMPLE 5.1. In the atmosphere, the most common form of oxygen is the 
molecule O, consisting of two oxygen atoms. Nitric oxide NO and Oy, react to 
produce nitrogen dioxide NO,. We write this reaction as 


(5a) 2NO +0, —> 2NO, 


Let [NO], [05], [NO] denote concentrations, which are functions of time t. Naively, 
the concentration is the number of molecules per unit volume, though chemists use 
more natural units called molarity. 

The Law of Mass Action describes the rate of change of concentration in a 
reaction. Applied to the above reaction, we get 


d 
(5.2) qNOel = 2« [NO]? [09], 
We explain this as follows: 


e The number of ways two NO and one Oy can collide is roughly [NO]?[O,]. 

e The rate constant « corresponds to how many potential collisions actually 
occur and how fast the reaction is. 

e The 2 on the right-hand side of the reaction (5.1) tells us that we preserve 
the number of atoms of N and O. 

e This gives the 2 on the right-hand side of the differential equation (5.2). 


There is a differential equation for each concentration, giving the system of ODEs: 


[NO] —2«[NO]? [05] 2 0), 2 
(5.3) “ 2 = ~K[NO}*[0,} = ' ° | or | 
[NO}] 2% [NO}"[O3] ; — 


The product Y A,@® hints at the structure that underlies the reaction (5.1). < 


A Small Lie. The chemistry of the reaction (5.1) is more complicated than 
first meets the eye. Our explanation of (5.2) required collisions between two NO 
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molecules and one Oy, molecule. But it is eatremely rare for three molecules to 
collide simultaneously. 

In reality, 2NO +O, + 2NO, is the result of elementary reactions that 
involve at most two molecules. A proposed mechanism for 2NO +O, > 2NO, 
uses two elementary reactions: 

NO+NO ==N,0, (fast) 
(5.4) i 
N,0, + 0, 24 2NO, (slow). 


The first reaction reaches a steady state very quickly, so that 
2 

(5.5) Kk [NO]° = k[N,09]. 

The second slower reaction then gives the differential equation 


£(NO,] = 2K3[N,0,][O,] = 2«[NO]?[0.], where « = K3 a 
However, these reactions are so fast that chemists cannot be certain that this is 
what actually happens, which is why we say proposed mechanism. These and other 
subtleties are discussed in the 2012 article by Gunawardena. 

For us, the upshot is that when we apply the Law of Mass Action to a reaction 
like 2NO + O, —*+ 2NOg, the resulting product [NO]?[O,] is a convenient fiction 
that hides a more complicated reality. 


Complexes, Species, and Directed Graphs. We now begin our study of the 
underlying structure of chemical reaction networks. 

The molecules occurring in a reaction are called species, and combinations of 
species are called complexes. The reaction of Example [5.1] has two complexes and 
three species: 


two complexes 


‘A \ 
(5.6) 2NO+O0,| —> |2NO, 


three species 


This reaction also gives some mathematical objects: 


e For each species, its concentration is represented by a variable: 
z, = [NO], tz = [O,], 23 = [NO]. 


e For each complex, we have a nonnegative linear combination of the species. 
This gives coefficient vectors: 


Y= (2,.1,.0), y= (0,0, 2) 


that tell us how each complex is built from the species. However, y; and yz 
are also the exponent vectors of the monomials: 


[NO]?[0q] = afjrg = xO) = a, [NOQ)? = 23 = 2?) = ge, 
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Then the system of differential equations (5.3) can be written more compactly as 


Y2 Y1 
—~ ms 
d XY 0 2 
(5.7) aH |72| = KH ( 0} —]1 ) 
X3 2 2 0 


gl 


Notice also that the two complexes in form the vertices of the directed graph 
e —> e underlying the reaction. 

We will soon see how all of this follows from the general form of the Law of 
Mass Action. But first, let’s explore another example. 


EXAMPLE 5.2. T-cell signal transduction gives the chemical reaction network 


complex 1 


A+B A+B 
K21 
(5.8) Cc £235 9 xy ng 
“8h, hA'B Lh 
complex 3 complex 2 


We have 4 species, 3 complexes, with 4 reactions since A+B — C is reversible. 
21 
The directed graph has vertices given by the complexes and directed edges given 
by the reactions. 
To reveal the structure of (6.8), we introduce variables and exponent vectors 
ele he eau t3 = [C], ta = [D] 
y1 = (1,1,0,0), yo = (0,0,1,0), ys = (0,0,0, 1). 


Then the Law of Mass Action gives the system of differential equations 


day YA y2 YB 
a = —K12%12%Q + K1%3 + K31L4 = —K1 22 Kate" + W310 
dara YA y2 YB 
sae. = —K12%1%Q + K1%3 + K31L%4 = —Ky1 2x Kai 0e" + K310 
(5.9) 
dx —_ o = Y1 5 Yy2 
HE. = K12%1%2 — K21%3 — KQ3%3 = KY QL™ — (Ka1 7 Ko3) 
dx4 F 
= = K93%3 — K31%4 = Ko3x"? — Kg, 0%, 
dt 
which can be written 
Ly —1 1 0 a 
d |x —1 1 0 1 
2 : 
5.10 — = Kyu" +k 0"? +hko30"? +4310 
( ) dt |x3 12 1 21 -] 23 = 31 0 
LA 0 0 0 —1 
Y2-Y1 Y1—Y2 ¥3—Y2 Y1—Y3 


What seemed like random + and — signs in (6.9) now make perfect sense in (5.10) 
once we realize that the column vectors on the right are differences y; — y; for 
suitable 7 and 7. < 
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Chemical Reaction Networks. A general chemical reaction network consists of 
the following data: 


e nm species and m complexes. 
e a directed graph G = (V, E) with positive weights K = (Ki-,;): 
— The vertices are the m complexes, i.e., V = [m]. 
— The directed edges are the reactions. 
— The weight of the directed edge i—] is the reaction rate Kj, ;. 
e An n xX m matrix Y whose columns yj,.-..,Y%m express the complexes as 
nonnegative integer linear combinations of the species. 
Since a chemical reaction network is determined by its reactions, we assume that 
G has no isolated vertices. For simplicity, we often write «;; instead of Ky,;. 
Given a chemical reaction network (G,«, Y), let x; be the concentration of the 
ith species. Each x; is a function of time, and the Law of Mass Action tells us that 


the vector x = (41,...,%n) satisfies the system of ordinary differential equations 
dx ; 
(5.11) a yee), eS eel 
tojek 


For example, the reaction (6.1) gives the system (5.7), and in Example [5.2] the 
reaction network (5.8) gives (6.10). Both of these are instances of the Law of Mass 
Action. In practice, the system (5.11) is often referred to as mass-action kinetics. 


The Graph Laplacian. Equation (5.11) is not the only way to write the Law 
of Mass Action. Adapting (6.3) to our current notation, the systems of ODEs in 


Example becomes 
dx _ ; : —K O} Jat 
dt kK Of} ja¥?]* 
Ne, re Nee ree” 


O(x) 


We now know what Y means, and ®(z) is the column vector of monomials «. But 
what is the matrix A, in the middle? This is where the graph Laplacian enters the 
picture. 

For a simple graph G = ([m], £), the graph Laplacian is the m x m symmetric 
matrix L = (Li;), where 


deg(i) i= j 
Ly={-1 #€E 
0 otherwise. 


This matrix is symmetric with rows and columns that sum to zero. 
EXAMPLE 5.3. Here is a graph from Chapter 4 and its graph Laplacian 
ss 3 3 -1 -1 -l 
-1 2 -1 0 
-1 -1l 3 -l 
1 2 i 2k 2 > 


This idea adapts easily to weighted graphs, where the off-diagonal entries are 
now —&,; and the diagonal entries are determined by the condition that the row 
and column sums are zero. 
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In our situation, we have a weighted directed graph (G,«), which introduces a 
measure of asymmetry. Here, the weighted directed graph Laplacian is the m x m 
matrix D,, defined by 


Do Risk G55 
5.12 LJjgse ee ee 
( ) (Lr)is Kyi t#j, jriek 
0 i#j, j>i€ EB, 
where the diagonal entries ensure that the column sums are zero. Here, we follow 
123], where L,, is denoted Lz. 

To get the matrix A, we want, we take the negative of L,, ic., Ay = —Lx. 
Note that the column sums of A, are zero. With this definition of A,, the Law of 
Mass Action (5.11) can be written more compactly as 

dx 
(5.13) — =YA, (2), 

dt 
where Y is the n x m matrix whose columns yj,...,Y%m express the m complexes in 
terms of the n species, A,, is as above, and ®(x) is the column vector of monomials 
v¥1,...,2¥™ in the species concentrations 71,...,%n- 


EXAMPLE 5.4. Consider the chemical reaction network of Example 5.2} 


complex 1 


A+B 


ow OS® 


D+—_.— C 
complex 3 complex 2 


We can write the system (5.9) in the form 


sa eo —K K K 
a v2 => : 0 K 7 —K a i. Pe 
di |x3|~ {0 1 0 os . 23 ay - 
LA 0 0 1 23 31 4 
Yy An B(x) 


<< 
Balancing Equations. There is another way to see how A,, arises naturally from 
G = ([m], E,«). Suppose we have a quantity C; at each vertex i. If we allow the C; 


to flow along the directed edges with rates given by K;_,;, then 


inflow at vertex 7 = y Kya Cy 


jrick 
outflow at vertex 7 = y Kya j7 Oi = ( y +4) Ce 
i>jeEB i3je€E 


The system is balanced when inflow = outflow at every vertex 2, i.e., when 


YS HG - (YD wis) =0 


jrick I> kee 


184 5. CHEMICAL REACTION NEWORKS 


for all 7. In terms of the matrix A,,, this system can be written 
Cy 
(5.14) A, |: | =0. 
Cm 
These balancing equations will play an important role in what follows. 
Stoichiometry. The Greek words ototyeto (stoichefo) and uéteo (métro) translate 


to element and measure respectively. Chemical reactions neither create nor destroy 
atoms; stoichiometry is the quantitative consequence of this basic principle. 


EXAMPLE 5.5. The reaction 2NO + O, + 2NO, has conserved quantities: 


(Nitrogen atoms) [NO] + [NO.] = [NO]p + [NO] 
(Oxygen atoms) [NO] + 2 [O,] + 2[NO.] = [NO] + 2 [Og]p + 2 [NOd]q 
The subscript 0 denotes initial concentrations. After some algebra, one obtains 
[NO] [NO]o —2 
[02] | = | [Oalo | +A) |-1] , AG) = —H([NO] — [NO]o). 
[NO,] [NO3]o 2 


For this reaction, stoichiometry implies that the reaction evolves along a line. <> 


Looking deeper into this example reveals the structure more clearly. When we 
write the Law of Mass Action in the form (5.11) for this reaction, we get 


de a2 9 
He TE ye — 1), y2— yi = |O-1] = |-1 
T=) 2 


d. 
The tangent vector ; always lies in the span of y2 — y1, so that 


(5.15) x(t) € (0) + Span(y2 — yr). 
A key observation is that the stoichiometry is determined by y2—y;. More precisely: 


e By (6.15), any vector v satisfying v - (y2 — y1) = 0 has the property that 
v- a(t) =v-x(0) for all t. Hence v- x is a conserved quantity. 

e One computes that {(1,0,1),(1,2,2)} is a basis of (ys — y,)+. This gives 
another way to see that [NO]+[NO.] and [NO]+2 [O,]+2 [NO] are conserved. 


Thus everything in Example [5.5] is a consequence of the vector yo — yj. 


The Stoichiometric Subspace. For a general mass-action kinetics system 
“ = x Ki+j 0% (yy — yi), 2 = (€1,---,En)*. 
i>jeE 
as in (5.11), the stoichiometric subspace is 
S = Span(y; — yi | iy € £). 
If x = x(t) is a solution of the system, then arguing as above shows that 
x(t) € x(0)+S. 


It follows that if we have a vector v € S+, then v- x is a conserved quantity and 
equals the constant v- x(0). 


5.1. THE CLASSICAL THEORY OF CHEMICAL REACTIONS 185 
However, in chemical reactions, concentrations are nonnegative. Thus x(t) € 
RSo, which implies that 
(5.16) z(t) € (2(0) + S)NR§po for t > 0. 


We call (x(0) + S) A R§q a stoichiometric compatibility class. Here is a picture: 


_ 


In general, stoichiometric compatibility classes are polyhedra. The above example 
is bounded, i.e., a polytope. In Section[5.2] we will see an example of an unbounded 
stoichiometric compatibility class. 

Above, we justified by saying that in the real world, concentrations 
cannot be negative. This can also be proved mathematically, i.e., 


a(0) € RSo => a(t) € (2(0) + S) ARQ for t > 0. 
A proof can be found in [129]. 


Steady State Solutions. An important idea in chemistry is that most chemical 
reactions achieve an equilibrium as time goes on. Let’s write the corresponding 


system in the form 


dx 
— =YA,O(z). 
a « P(x) 
A steady state or equilibrium solution is a solution where the concentration vector 
v= (x1,...,%n)* is constant. Thus x is a steady state solution if and only if 
YA,®(x) = 0. 


This system of polynomial equations shows that the steady state solutions lie in 
the steady state variety 


(5.17) Vv = V(YA,®(x)) CR”. 
The physically meaningful steady state solutions correspond to 
(5.18) So = V(Y A, ®(z)) A RSo. 


Observe that V** is a real algebraic variety and V$p is a semialgebraic variety. Here 
are two simple examples. . 


EXAMPLE 5.6. For 2NO+0, —*+ 2NOn, the steady states of the system &.3) 
satisfy [NO]?[O,] = 0, so 
[NO] = 0 or [0,] =0. 


In other words, we run out of one of the reactants. This is extinction. <P 
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EXAMPLE 5.7. In @.4), we decomposed Example[5.6jinto elementary reactions, 
one of which is NO + NO —2s N,O,. The resulting system is 
21 


4 [NO] = —2K12[NO]? + 2K21[N,05] 
7 N20o] = K12[NO}? — K21[N,09]. 
The steady states satisfy the equation 
K12[NO]? = ka1[N,05] 
that appeared earlier in (5.5). < 


Three Examples. So far, we have studied networks of chemical reactions. But 
there is a wide range of phenomena having nothing to do with chemistry that can 
modeled using “chemical reaction networks”. Here we present three examples, one 
of which we’ve already seen. 


EXAMPLE 5.8. Recall the HIV model from considered in Example 


Description Interaction Parameter value 
Generation of new T cells 9—>T 10 
Generation of new macrophages 0+ M 1.5 x 1071 
Proliferation of T cells by presence of pathogen T+ V Jes (T+V)4+T 2x 10-3 
Infection of T cells by HIV T+V 2, T; 3x 1073 
Proliferation of M by presence of pathogen M+V ay (M+V)+M_ 7.45 x 10-4 
Infection of M by HIV M+V 4, M; 5.22 x 10-4 
Proliferation of HIV within T cell qT; +s, V+T; 5.37 x 1071 
Proliferation of HIV within macrophage M; eo, V+M; 2.85 x 1071 
Natural death of T cells gals ) 0.01 
Natural death of infected T cells Le 2, 0 0.44 
Natural death of macrophages M BGs ) 6.6 x 1073 
Natural death of infected macrophages Mi ote 0 6.6 x 107-3 
Natural death of HIV v 2559 ? 


The interactions between cells shown here clearly form a chemical reaction network. 
One difference is that cells can be created and can die. For example, the “reactions” 


@ > T and T os —) represent the creation and death of T cells. 
We think of these interactions as a biochemical network, where the species are 
cells rather than atoms. Applying the Law of Mass Action, this network gives 


£T = 5, +k TV — koTV — 5T 
£T; = keTV — oT; 

(5.19) 4M = s8.+k3MV — kyMV — 63M 
4M; = kaMV — 64M; 


£V = ksT; + ko Mi — 55V, 


I 


exactly as in (2.23). It is nice to see how this biological example from fits 
into the framework of chemical reaction networks. <— 
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EXAMPLE 5.9. In epidemiology, the SIR model is given by 


S = number of susceptible individuals = = —asl 
I = number of infected individuals ~ = aSI— BI 
R = number of recovered individuals ee = BI 


Many courses in differential equations present this as a compartment model: 


S| — JI] — |R). 


Sometimes there are labels on the arrows: 


aSI BI al B 
S| — |I] —> |R]|_ or S| —> {I} —> |R}. 
labels are rates of change labels are relative rates of change 


We can also think of this as a reaction network: 
Shires 


The Law of Mass Action gives the above differential equations. 

The difference between these two presentations of the SIR model is easy to 
understand: the reaction network is complex based, while the compartment model 
is species based. <— 


EXAMPLE 5.10. In population genetics, chromosomes have locations where dif- 
ferent alleles can occur. Examples of alleles include blood type or the color of a 
flower. Assume that we have two alleles A, B with frequencies p, q. In this situation, 
the genotypes AA, AB = BA, BB have what are called Malthusian fitnesses a, b,c. 
Roughly speaking, these are the logs of the average numbers of offspring. 

Going from one generation to the next gives a discrete model. Passing to a 
continuous model, the resulting differential equations are 

marginal mean fitness of 


fitness of A the population 


d / : 2 2 

aP = p(ap + bq) — p(ap* + 2bpg + cq’) 

£4 = q(bp + cq) — q(ap* + 2bpq + cq’). 
In 1979, Hars and Toth characterized which systems of ODEs come from 
chemical reaction networks. The above differential equations satisfy their criteria, 


and using the method described in [189], it is straightforward to show that these 
ODEs come from the network 


2A A A+B A 2B 
aja Za SS Ele 
34 2A+B A+2B 3B 
via the Law of Mass Action. <— 


Questions About Steady States.. Given a system he = YA,®(z), there are 
several questions to ask about a nonnegative steady state solution 7, € VS5. 
The stoichiometric compatibility class of x, is (x, +5)R&,, where S C R” is 


the stoichiometric subspace. The first question is whether (7, +.S)N R&p contains 
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other steady states. In other words, how big is V8. (a. +S)? The answer can 
depend on which stoichiometric compatibility class we are in, as illustrated here: 


ss 
VSG 


A network exhibits multistationarity when some stoichiometric compatibility classes 
contain more than one steady state. Monostationarity occurs where there is at most 
one steady state in each stoichiometric compatibility class. 

Since solutions of ee = YA,®(x) always stay within a given compatibility 
class, there are two further questions to ask about a steady solution x, € VSo: 

e Is x, locally attracting, ie., if (0) € (v +S) 1 R§p is close to x, does 

x(t) > a, as t > 00? 
e Tf VS5N(t. +S) = {zx}, is x, globally attracting, ie., if x(0) € (t.+S)AR§g, 
does x(t) + x, as t + 00? 
We will see eventually that some of these questions have very satisfying answers, 
while others remain open. 

EXAMPLE 5.11. In the HIV model presented in Example[.8] the steady states 
of (6.19) are solutions of the polynomial system 

i= Sy. ky TV koTV oT 

0 = keTV — b0T; 

0 = so +k3MV —kaMV — 63M 

0=kaMV — 64M; 

0 = ks5Ti + keM; — O5V. 
In Example [5.8] we assumed that all rate constants are known except for 65. The 
five types of cells T,T;,M,M;,V (the species) and the unknown rate constant ds 
give the steady state variety V°* C R° x R=R*. 

We saw in Example that Gross and Bates use MACAULAY2 to write 
the variety V** as the union of two irreducible components: the main component 
Vin = Vifi fo; fs, fa, fs) with 

fi = 5742 M — 2453 M; — 130500 

fo = 259908 T; — 46607 M7; + 484000065 — 20200500 

fs = 17721T + 46607 M; — 484000065 + 2479500 

fa = 484000 V 65 — 184547 M; + 484000065 — 20200500 
fs = 2453 M,V — 72600 M; + 130500 V, 


and the extinction component 
Ve = V(V, M;, 11M — 250, 7;, 7 — 1000). 


This is a nice example of how ideas from computational algebraic geometry can 
help explain a biological system. <— 


5.1. THE CLASSICAL THEORY OF CHEMICAL REACTIONS 189 


The Creators of Chemical Reaction Network Theory. Before discussing 
some of the classic results of CRNT, let’s say a few words about the people behind 
this theory: 


e Fritz Horn (1927-1978) 
e Roy Jackson (1931-) 
e Martin Feinberg (1942-) 


Among their many papers on CRNT, most written between 1972 and 1995, are 
147\/211}/213]. Horn, Jackson and Feinberg all came from chemical engineering, 
where a basic object is a stirred tank: 


Cooling jacket 


(5.20) Baffle 


Mixed product 


The cooling jacket is important, since reaction rates can vary with temperature but 
we want the reaction rates & to be constant. The arrows labelled Feed and Mixed 
product allow reactants to be added and products to be removed. This corresponds 
to reactions § —> A, B —> 0 in the network. 

There are several excellent expositions of CRNT, including the 1979 lectures 
of Feinberg (freely available on the internet) and the 2003 lectures [182] and 
2012 survey by Gunawardena. The latter has a nice discussion of the Matrix 
Tree Theorem, which will play a significant role in Section 


Two Classic Results of CRNT. The remainder of this section will be devoted 
to a discussion of two cornerstones of CRNT, the Deficiency Zero Theorem and the 
Balanced Dynamics Theorem. Here is a stripped-down version of the former. 


THEOREM 5.12 (Deficiency Zero). A chemical reaction network is complex 
balanced for every set of positive reaction rates K if and only if it has defictency 
zero and is weakly reversible. 


Once we explain “complex balanced” and “weakly reversible,” we will be able 
to state the Balanced Dynamics Theorem. After defining “deficiency”, we will be 
ready for the full version of the Deficiency Zero Theorem. 
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Complex Balanced Networks. A chemical reaction network (G,«,Y) leads to 
two sorts of objects: 


e Steady states x, of the ODE system satisfy the equation YA, ®(x,) = 0. 

e Balancing equations of weighted directed graph (G, «) have coefficient matrix 
A,, by (6.14). These are the complex balancing equations since the vertices 
of G correspond to complexes. 


A steady state x, is complex balanced when ®(z,.) satisfies the complex balancing 
equations, i.e., A,®(a,) = 0. 

The following example shows that a complex balanced steady state need not 
use the whole network. 


EXAMPLE 5.13. Consider the reaction network with complexes C1, C2, C3, C4: 
R12 K23 K34 
Cie Ce — Cy Ca 


(5.21) 
; a 
strong linkage class terminal 
strong linkage class 


For the moment, ignore the labels in the diagram. The first two complex balancing 
equations of the weighted directed graph (5.21) are 


inflow Cy outflow Cy inflow C2 outflow C2 
K21Cg = fi2C1 and Ky2Cy = K21C2 + K23Co. 


These equations imply that C; = Cz = 0, which makes sense since the middle 
reaction allows the left two-cycle to drain completely into the right two-cycle. Thus 
any complex balanced solution of (5.21) only uses part of the network. << 


To avoid this behavior, we make the following definition. 


DEFINITION 5.14. A chemical reaction network (G,«,Y) is complex balanced if 
it has a positive steady state solution x, which is complex balanced, i.e., satisfies 
A, ®(«,) =0 with x, > 0. 


The key word in Definition [5.14] is positive, which implies that x, and hence 
®(x,.) = (a',..., 2%") are both positive. Thus a complex balanced network has a 
complex balanced steady state that uses every complex of the network. 


Strong Linkage Classes and Weak Reversibility. The network in is 
connected as an ordinary graph but not as a directed graph. The two-cycles on the 
left and right are its strong connected components or strong linkage classes. The 
latter terminology is common in the CRNT literature. The strong linkage class on 
the left is terminal since nothing flows out of it. This explains the labels in (6.21). 

Given a chemical reaction network (G,«, Y), it is straightforward to show that 
the following conditions are equivalent: 


e Every connected component is strongly connected. 
e Every complex lies in a terminal strong linkage class. 
e For every reaction C; —+ Cj, there is a sequence of reactions 


Cy —> Ch ++ 3 Cn > G. 


DEFINITION 5.15. A chemical reaction network is weakly reversible if it satisfies 
any of the above conditions. 
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The network in Example [5.13] is not weakly reversible. In contrast, consider 
the network in Example 5.4} 


A+B 


The reaction A +B +> C is reversible via C “> A +B, while D “> A+B is 
weakly reversible via A + B “2+ C “3s D. The story for C “> D is similar, so 
the whole network is clearly weakly reversible. 

Here is an important recent result about weakly reversible systems proved by 
Boros in 2019 [37]. When (0) is positive, it is customary to call (x(0) +S) R& 
a positive stoichiometric compatibility class. 


THEOREM 5.16. If (G,«%,Y) is a weakly reversible chemical reaction network, 
then every positive stoichiometric compatibility class has a positive steady state. 


Dynamics of Complex Balanced Networks. The existence of a single positive 
complex balanced steady state has remarkable implications for the dynamics of the 
system. This is the Balanced Dynamics Theorem. 


THEOREM 5.17 (Balanced Dynamics). Let (G,«,Y) be a complex balanced 
chemical reaction network. Then: 


(1) The network is weakly reversible. 

(2) Every positive stoichiometric compatibility class has a unique positive steady 
state. 

(3) Every positive steady state is complex balanced and locally attracting within its 
stoichiometric compatibility class, 


ProoF. For (1), suppose that a positive steady state x, satisfies A,,®(x.) = 0. 
Since the reaction rates « are positive, one can prove that ker(A,,) is supported 
on the terminal strong linkage classes. (In Lecture 4], Feinberg notes that 
this can be derived from the Perron-Frobenius Theorem. Self-contained proofs 
can be found in and Theorem 4.2].) The positivity of x, implies that 
®(x,) € ker(A,) is positive. It follows that every complex belongs to a terminal 
strong linkage class, proving weak reversibility. 

Proofs of (2) and (3) can be found in the 1972 paper of Horn and Jackson ; 
We recommend Lectures 4 and 5] and Theorem 6.4]. 


This powerful theorem tells us that the existence of a positive complex balanced 
steady state implies that all positive steady states are complex balanced. The name 
Balanced Dynamics Theorem is not standard and will change to Toric Dynamics 
Theorem in Section [5.2] 

Theorem [5.17] also tells us that positive steady states are unique and locally 
attracting within their stoichiometric compatibility classes. Are they globally at- 
tracting? Jackson and Horn thought so in 1972, but later realized that their proof 
only implies that positive steady states are locally attracting. In 1974, Horn 
formulated the Global Attractor Conjecture: in a complex balanced network, every 
positive steady state is globally attracting in its stoichiometric compatibility class. 
We will say more about this conjecture in Section [5.2] 
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One thing to keep in mind is that the existence of a positive complex balanced 
steady state may depend on the reaction rates. Here is an example. 
EXAMPLE 5.18. Consider the network: 
y A+A a 


B+B—=A+B 


K23 


In Section [5.2] we will see that this network is complex balanced if and only if 
(K21K31+K93K31+K32K21)(K13K23+K12K23+K21K13) = (Ky2K32+K13K39+K31K12)>- 


When this is satisfied, Theorem describes the dynamics of the system. < 


The Deficiency of a Network. The deficiency is a numerical invariant of a 
reaction network introduced by Feinberg in 1972 [147]. 


DEFINITION 5.19. The deficiency of (G,«,Y) is 
d6=m-—L-s, 


where 


e m= number of complexes = number of vertices of G. 
e €= number of connected components of the underlying ordinary graph. 
e s = dimension of the stoichiometric subspace S = Span(y; — y; | ij € E). 


Feinberg proved 6 > 0 by showing that 
(5.22) 6=m-——s=dim(ker(Y) NM im(A,)). 


In the weakly reversible case, we will give an algebro-geometric proof of 6 > 0 by 
interpreting 6 as a codimension in Section 

The definition of 5 involves only G and Y, yet by (6.22), it tells us something 
about A,. Here an easy example of how to exploit this. 


LEMMA 5.20. If (G,«,Y) has deficiency zero, then for any positive kK, every 
positive steady state is complex balanced. 


ProoFr. A positive steady state satisfies Y A,,®(x,) = 0, so that 
A,,®(a.) € ker(Y) Mim(A,) = {0}, 
where the last equality follows from (5.22) and 6 = 0. Hence A, ®(x.) = 0. 


The Deficiency Zero Theorem.. Here is the full version of the Deficiency Zero 
Theorem, proved by Horn in 1972 [211]. 


THEOREM 5.21 (Deficiency Zero). A chemical reaction network is comples bal- 
anced for every set of positive reaction rates k if and only if it has deficiency zero 
and is weakly reversible. Furthermore, in a weakly reversible network of deficiency 
zero, we have: 


(1) For any positive rate constants & and any positive stoichiometric compatibility 
class C’, there is a unique positive steady state solution x, € C. 
(2) The solution x, is complex balanced and locally attracting within C. 
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ProoF. If the network is weakly reversible and has deficiency zero, then: 


e Weak reversibility and Theorem.16]imply that every positive stoichiometric 
compatibility class has a positive steady state solution. 

e Deficiency zero and Lemma|5.20/imply that these positive steady states are 
complex balanced. 


It follows that the network is complex balanced for any positive x, and then parts 
(1) and (2) follow immediately from Theorem 5.17] 

Going the other way, Theorem [17] tells us that complex balanced implies 
weakly reversible. Finally, we need to prove that 6 = 0 when the network complex 
balanced for all for positive x. As we explain in Corollary this follows from 
the Toric Deficiency Theorem (Theorem 5.35), to be discussed in Section 


Other proofs of the Deficiency Zero Theorem can be found in and : 
Gatermann and Huber give an interesting proof in [162| Theorem 7.1]. 
Here are two simple examples. 


EXAMPLE 5.22. Consider the following reaction network: 


A+C= =D 


R21 


my fen Y= 


B+E 


oorcocr 
oOoroc;9cjo 


0 
1 
O;. 
0 
1 


This is weakly reversible. We compute the deficiency 6 =m — £— s: 
e m= 3 (complexes) and ¢ = 1 (connected components). 
e S = Span(ye — y1, y1 — Y2, ¥3 — Y1, Y2 — Ys) = Span(ys — yi, y2 — ys)- 
Thus 6 = 3—1-—2=0, so Theorem [5.21] applies. < 


EXAMPLE 5.23. In a similar way, one computes that 6 = 1 in Example[5.18] so 
the network cannot be complex balanced for all « by Theorem 5.21] In Section 
we will see that 6 = 1 implies that a single equation determines when « is complex 
balanced, just as we claimed in Example 5.18] < 


Final Comments. This section introduced chemical reaction networks and their 
associated systems of ODEs. We learned a little chemistry and saw a range of 
applications. There was also some nice mathematics, including a bit of algebraic 
geometry (e.g., positive steady states form a semialgebraic variety). In the next 
section, we will look more deeply into the combinatorics and algebraic geometry of 
chemical reaction networks, including the Matrix Tree Theorem and toric varieties. 


5.2. Toric Dynamical Systems 


In 2009, Craciun, Dickenstein, Shiu and Sturmfels proposed the name 
toric dynamical system for differential equations of the form 


dx 
that are complex balanced, i.e., there is a positive steady state x, that satisfies the 
complex balancing equations A,,®(z,) = 0. We use “toric dynamical system” for 


both the differential equations and the underlying reaction network (G,«,Y). 
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In this setting, the Balanced Dynamics Theorem (Theorem [5.17) becomes the 
Toric Dynamics Theorem: 


THEOREM 5.24 (Toric Dynamics). Let (G,«,Y) be a toric dynamical system. 
Then: 


(1) Every connected component of G is strongly connected (weak reversibility). 

(2) Every positive stoichiometric compatibility class has a unique positive steady 
state. 

(3) Every positive steady state is complex balanced and locally attracting within its 
stoichiometric compatibility class, 


The reason for changing to “toric dynamical system” and “Toric Dynamics 
Theorem” is the deep connection to toric varieties, which will emerge later in the 
section. 


Example of a Toric Dynamical System. Before jumping into the general 
theory, let’s study an example in detail. 


EXAMPLE 5.25. Consider the following network: 


complex 1 J x en iG 
A+A 
(5.23) ze _ mt 2 Te, 2 N. 2 
B+B A+B 
complex 3 complex 2 3 L. 3 may 8 TAN 


For the moment, ignore the right-hand side of (6.23). For this network, we have 


—Kh12 — K13 R21 K31 
Ay = K12 —K21 — K23 K32 ; 
K13 K23 —K31 — K32 


and one computes that the kernel of A, is spanned by (Ky, K2, K3), where 


Ay = K21K31 + K93K31 + K32K21 


(5.24) Ko = K12K32 + Ki3k32 + 31k 12 


K3 = &13K23 + K12K23 + K21K 13. 


The kK; have a lovely combinatorial structure: each K; is determined by the spanning 
trees with a unique sink at i, as shown on the right-hand side of (6.23). We will 
see below that this is a special case of the Matrix Tree Theorem. 

As noted in Example [5.23] the deficiency is 6 = m—&—s = 1. Here are the 
details: 


e m= 3 (complexes) and ¢ = 1 (connected components). 
2 1 0 : 1 
eY= F l | => S= Span(y;—y; | 1 #4) = Span (Ei = s= 1. 


Since this network is weakly reversible, the Deficiency Zero Theorem tells us that 
not all positive rate constants & give a toric dynamical system. To find those that 
do, suppose that x, = (21,22) is positive and satisfies A,®(a,) = 0. Using the 


5.2. TORIC DYNAMICAL SYSTEMS 195 


above description of ker(A,,), we obtain 


2 
Ty ky 
(5.25) ®(x,) € ker(A,) => |2iz2| =A |K2| — Ki K3 = K? 
2 
v5 Kz 


since A 4 0 by the positivity of x,. Later in the section we will use the Toric Moduli 
Theorem (Theorem [5.34) to prove the converse, so that we have a toric dynamical 
system if and only if Kj K3 = K3, as claimed in Example[i.18 

To complete the example, let’s look more deeply into the algebra of what we 
just did. From (5.25), we get the system of equations 

core _ AK, Eyl = AKa, xr = AK3 

in variables 1,22, K,, Ko, K3,, Eliminating , we get the ideal 
(Kox? = Ky 2122, K32122 — Kox?, K327 = K,23, K323 = Kori 29, o3(K,K3 = K3)). 
Note that the first three generators are K,x¥i — Kjx% for 1 > j, which suggests 
that something systematic is going on here. 

Since we want x. = (1,22) to be positive, they should both be nonzero. Hence 
it makes sense to saturate with respect to 71472. Doing so gives the ideal 

To = (Koa1 = Ky 22, K321 = Kx, Kk, K3 = geo 
We will see below that Tg is a toric ideal. Then the equation K,K3 = K3 we first 
saw in Example arises naturally from the elimination ideal 
Me =Te NQ[RK1, Ko, K3] = (K,K3 — K3). 

This is another toric ideal. <— 


Changing Variables. In terms of «, the equation K,K3 = K} becomes 
(Ko1k31+423K31+K32K21)(Kigka3t+h12K23+K21K13) = (Ki2k32+h13h32+K31K12) >, 
which defines a complicated variety. The toric nature of this example emerges only 

after we make the change of variables given by (5.24). 


The phenomenon of “toric emergence” happens in other contexts and in par- 
ticular occurs in an example from algebraic statistics that appeared in Chapter 2. 


EXAMPLE 5.26. In Example [2.17] we considered a phylogenetic model from 
and in observed that the probabilities p;;x, i,j,k € {0,1}, satisfy the 
obvious linear equation pooo +: -+p111 = 1 together with three quadratic equations 
and many inequalities. Let’s focus on the quadratic polynomials, one of which is 


Po01P010+P001P100 —PoooPo11 —PoooP101 +P100P111 —P101P110 +Po010P111—Po11P110- 


The other two are similar. These polynomials are not overly complicated but, as 
explained by Sturmfels and Sullivant in Example 3], they become even nicer 
via the change of coordinates 
1 
Pijk = ¥ eS le aad ae 
r,s,t=0 

This can be regarded as a finite Fourier transform. In the new coordinates, the 
three quadratic polynomials transform to 


40019110 — 40004111; 40104101 — 4o00@111;, 410049011 — 4o004111- 
These are not only simpler but also define a toric variety! <— 
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In the situation of Example 5.25) changing from k12, K21, K34, K43, K45,K54 to 
Ky, Ko, K3 gives the toric equation K,K3 = K%. To do this change of variables in 
general, we need the Matrix Tree Theorem. 


The Matrix Tree Theorem. Toric dynamical systems are weakly reversible 
by Theorem [5.24] For this reason, in the rest of the section we will only consider 
networks (G,«, Y) where G is weakly reversible. Hence every connected component 
of G is strongly connected. Let G1,...,Gz be the connected components of G. We 
also assume that the the vertices of G are the set [m] = {1,...,m}. 

In this situation, for 7 € [m], define 


(5.26) k=) x 
where: ae 

e The sum is over all maximal subtrees T of G with a unique sink at 7. We 

denote this by writing TJi. 

e «” is the product of rate constants corresponding to edges of T. 
When the vertex 7 lies in the connected component G,, a maximal subtree of G 
containing 7 is simply a spanning tree of G,. So the above sum is over all spanning 
trees of G, that have a unique sink at 7. We have already seen an example of K; 
in (5.23) and (5.24). 

Recall that A, = —L,, where L, is the graph Laplacian defined in (5.12). 
Using the K;, we can describe the kernel of A, as follows. 


THEOREM 5.27 (Matrix Tree). If G is a weakly reversible directed graph with 
connected components G,...,Ge and positive edge weights kK = (Ki+;), then: 
(1) dim(ker(A,,)) = @. 
(2) A basis of ker(A,,) C R™ is given by 


Vp = ‘> Kei, B= Up aay 
i vertex of G, 
where K; is defined in (5.26) and {e1,...,€m} is the standard basis of R™. 
(3) If G is connected and B,, is the (m—1) x m matrix obtained by removing the 
last row of Ay, then Ky,...,Km are the signed maximal minors of By. 


PRooF. For (1) and (2), observe that after reordering the vertices, A,, = Ac, 
becomes a block matrix whose diagonal blocks are Ag, .,, v = 1,...,¢, with 0’s 
elsewhere. So we may assume that G is connected and thus strongly connected. 
In this situation, the trees T’ in the definition of K; are spanning trees of G with 
a unique sink at 7, and (1) and (2) reduce to the assertion that (Ky,...,Km) 
spans ker(A,,). This follows from Lemma 1 and Theorem 1 in 9.13.4], which 
includes a nice discussion of the Matrix Tree Theorem and references to the proof. 
For another proof, see Theorem 3.2]. 

For (3), let B’ be obtained from B, by removing column i. We need to prove 
that K; = (—1)’~' det(Bt). Theorem 3 of implies that Kj; = det(L':*), where 
the superscript “i,2” denotes the matrix obtained by deleting row i and column i 
of L,,. Since the column sums of L, are 0, it follows that 


Kj = det(Ly") = (-1)'"™ det(L2) 
= (-1)'-™(-1)™"1 det(Ai) = (-1)'1 det (Bi), 


since L, = —A, and B, is obtained from A, by removing row m (the last row). 
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EXAMPLE 5.28. Removing the last row of the matrix A, in Example[5.25] gives 


B= —K12 — 13 K21 K31 
<= 
K412 —K21 — 823 N32 
and one computes without difficulty that the signed maximal minors are precisely 
Ky, Ko, K3 from (6.24). CP 


Ideals, Varieties and Chemistry. This is the title of Section 2 of [113], which 
introduces the algebraic language used to describe toric dynamical systems. Given 
(G,«,Y) with G = ([m], E) weakly reversible, we have as usual the ODE system 


d 
op =YA,S 
di 2) 


for concentrations x = (a 1,...,2,)'. We regard the rate constants K;; for i>j € E 
as variables, giving the polynomial ring Q[x, K] = Q[21,...,%n, Ki; | ij € E]. 

For a toric dynamical system, we seek positive solutions of A,(®(x)) = 0. 
Algebraically, the closest we can come to “x; > 0” is “x; 4 0”. As in Example[5.25] 
this means saturating with respect x, ---%,. Hence we get the ideal 


Ce = (A, ®(a)) : (1 +++ tn) C Qa, «]. 
If V5o(-) denotes solutions whose coordinates lie in Ryo, then the semialgebraic 
variety 
V>0(Ca) C R&q x RE 
has the following lovely interpretation: if (v.,«.) € R&, x R£o, then 


(tx, Kx) € Wso(Ca) <> (G,k.,Y) is a toric dynamical system and 


(5.27) x, is a positive complex balanced steady state. 


In other words, (xx, 4) € Wso(Cg) means that x, is a “witness” or “certificate” 
proving that (G,«., Y) is a toric dynamical system. 

Following [113], we next describe some interesting elements of Cg. Suppose 
that (x,,k«) € Wso(Cg). Then let K, = (Ky1,..., Kam) € R& be obtained by 
evaluating the Kj; at k.. Since A,, (®(x,)) = 0, we have 


L 
P(x.) = ye Av ie Kypek 
v=1 k vertex of G, 
by the Matrix Tree Theorem. In particular, if i, 7 are vertices of the same connected 
component G, of G, then 


=> K,,0%) — K,;04' =0. 
at nes) ail x agly. 


Thus the polynomial K;x”i — K;2% vanishes on Vso(Ca). The intrinsic reasoning 
for this vanishing is given by the following lemma. 


LEMMA 5.29. Kyx4i — Kjx24' € Cg wheni,j are vertices in the same connected 
component of G. 


PROOF. Following the proof of Theorem |5.27| we can assume G is connected. 
If we set Ay (®(x)) = (uy,.--,Um)*, then uy,...,Um € Ca. 

For simplicity, we will prove K,x2¥?2 — Kor¥1 © Cg. As in Theorem [5.27] we 
remove the last row of A, to get an (m— 1) x m matrix B,. Thus B,(®(«)) = 
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(U1,---,;Um—1)*. Let wi,...,Wm be columns of By, i-e., By = [wi-+:Wm]. Given 
any row vector D = (D,,...,Dm-—1), one computes that 
Dyuy +--+ + Dy-1iUm-1 = D(B,®(x)) = (DB,,)®(a) 
= (D-wy)a! + +--+ (D> wy). 


Now let Dy,...,Dm-—1 be the signed maximal minors of the (m — 2) x (m— 1) 
matrix [w3--:Wm] obtained by deleting first two columns of B,. Expanding by 
minors along the first column shows that 


det[w w3:::W,] = D-w 
for any column vector w. Then the above formula simplifies to 
Dyuy +--+ + Dm—1Um-—1 = det|wi w3 +++ Wm)x! + det[w2 w3 +--+ Wm])x”? 
= det(B?)2™ + det(Bl)2” = —Koz™ + Kix, 
where the last line uses AK; = (—1)*~1 det(B*) from the proof of Theorem [5.27] 


A Change of Variables and a New Ideal. Once we regard the rate constants 
kK as variables, the K; give the subring 
Q[K] = QiKy,..., Km] © Qa]. 
By Lemma 5 of [113], the K; are algebraically independent over Q. Hence we can 
regard Q|K] as a polynomial ring. This leads to the ideal 
Te = (Kya = Kaa 


in the polynomial ring Q[z,K]. This is a toric ideal (prime and generated by 
binomials), as proved in Proposition 6]. A proof using multi-Rees algebras 
can be found in Theorem 3.3]. 

The following basic result explains the relevance of the toric ideal Tg. 


[oe} 


i,j in same connected component of G’) : (a1 --- an) 


PROPOSITION 5.30. Given (x, Kx) € R&o x R&o, the following are equivalent: 


(1) (G,«,Y) is a toric dynamical system and x, is a positive complex balanced 
steady state. 

(2) (t4, 4%) € V>0(Ca). 

(3) (t«, Kx) € Vs0(Ta). 


PROOF. The equivalance (1) = (2) is (27), and (2) => (3) follows from the 
discussion leading up to Lemma .29] For (3) = (1), note that K,; and x%' are 
positive for all 7. By the definition of Tg, the ratio xf’ /K; depends only on the 
connected component of G containing 7. If we set 

Yi 


x 
At = Kn i any vertex of G), 


then 


¢ ¢ 
O(x,) = S- S- cee, = > Av s Kyrek, 


v=1k vertex of G, v=1 k vertex of Gy 
which is in ker(A,,,) by the Matrix Tree Theorem. Thus 2, is complex balanced 
for the rate constants «,, proving that (G,«,,Y) is a toric dynamical system. 


Using the change of coordinates from « to K, we now have toric ideal that tells 
us exactly when we have a toric dynamical system with a witness. 
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EXAMPLE 5.31. In the situation of Example [5.25] the ideal Te C Q(z, K] = 
Qlx1, v2, Ky, Ko, K3] is 
Ta = (Kou? — K12 22, K3x? — Ky, 22, K3x 122 — Kyx2) : (x1 4223)° 
Using the method explained in Theorem 14 of Chapter 4, §4], we obtain 
Tg = (Kon — Ky x2, K3x1 — Kox2, Ki K3 — K3), 
exactly as claimed in Example [5.25] < 


Moduli of Toric Dynamical Systems. The next step is to get rid of the witness 
x,. In other words, what conditions must «., satisfy in order for x, to exist? 


EXAMPLE 5.32. Let’s see what happens in Example[.31] Suppose k. satisfies 

Kui Ky3 = K2,. We know from Example [5.25] that the stoichiometric subspace S' is 
defined by 7; + x2 = 0. Given c > 0, the point 

Ly = ef Kut Ky2 ) €R2, 

Kui + Ka2) Kai + Ku 

lies in the stoichiometric compatibility class 71-+-22 = c. More importantly, (a., K.) 

clearly satisfies Ky2%%1 — Ky10.2 = 0, and then Ky32.1 — Ky2%%2 = 0 follows from 

Kui Ku3 = K?,. Thus (xs, Ky) € Vso(Ta), so (G, kx, Y) is complex balanced. This 

proves the claim about complex balancing made in Example 5.18] < 


This example is nice, but we want theorems, not computations. The idea is to 
use elimination theory. Given Tg C Q[z, K], we define the toric moduli ideal 


Me = Te 1 Q[A] € QUA]. 
Note that Mg is an elimination ideal. 
LEMMA 5.33. Me is a toric ideal. 


Proor. We noted above that Tg is a toric ideal. Thus Mg is prime since T¢g is 
prime. Applying the Buchberger algorithm to the binomial generators of Tg shows 
that Tg has a binomial lex Grébner basis. Hence Mg has binomial generators by 
the Elimination Theorem. 


The key property of Mg is that it defines the moduli space of toric dynamical 
systems for G and Y. 


THEOREM 5.34 (Toric Moduli). Assume G is weakly reversible. For kx € R&o, 
(G,k.,Y) is a toric dynamical system if and only if Ky € Vso(Ma). 


Proor. This is Theorem 7 from [113]. The intersection Mg = Tg 1 Q|K] 
gives a commutative diagram of inclusions and projections: 


Vs0(Ta)— R89 x R& 


le | 


V30(Me) > R&S, 


with an analogous diagram over C. 

If (G,Kx,Y) is a toric dynamical system, then by Lemma [5.30] there is zx, 
such that (#,,K,) € Wso(Tc). Thus K, = 1(x,,K.) € Vso(Mc). For the other 
direction, assume k, gives K, € Wso(Ma). We need to prove the existence of a 
witness x, with (x,,K.) € Wso(T¢). If we work over the complex numbers, the 
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Closure Theorem (Theorem 3 of [104] Chapter 3, §2]) implies that 7 has Zariski 
dense image, i.e., 


(5.28) m™(Vc(Ta)) = Ve(Me). 
Since Tg and Mg = Tg N Q|K] are toric ideals, the projection map 
m: Vc(Te) + Vc(Ma) 


is a toric morphism, which induces a group homomorphism of the associated com- 
plex tori. The image of this map is Zariski dense by and is a subtorus by the 
theory of algebraic groups. Thus the map on tori is surjective. 

Since Kk, € Vso(Me) lies in the torus of Vc(Mc), it follows K, is the image 
of a point in the torus of Wc(Tc), which we write as (21,...,2n, Kx) € Wc(Ta) 
with 21,...,2, € C*. We have a complex witness! Since K, is positive and Tg is 
a binomial ideal, it follows that x. = (|z1|,...,|Zn|) satisfies (7, A.) € Wso(Te). 
Thus x, is the desired witness. 


The proof of Theorem[5.34]is a lovely combination of classical elimination theory 
(which works over C) and toric geometry (which enables us to translate from C to 
Rso). The statement of the theorem reminds us that we always assume that G is 
weakly reversible, since otherwise (G,«.,Y) is never complex balanced. 

Our final result from concerns the deficiency. 


THEOREM 5.35 (Toric Deficiency). If G is weakly reversible, then the deficiency 
6 is the codimension of Vc(Mag) C C™. 


We refer the reader to [113| Theorem 9] for the proof. Here is a nice corollary. 


COROLLARY 5.36. If (G,«%,Y) is complex balanced for all positive K, then it 
has deficiency zero. 


Proor. The polynomials K1,...,Km € Q{x] define a map K : C® + C™. 
Our hypothesis and Theorem [5.34] imply K(R£9) C Vso(Mc), so that we have 
inclusions 


K(R£5) © Vs0(Me) © Ve(Mag) € C™. 


Since RZ is Zariski dense in C”, we obtain 
K(C*) = K(R&,) C K(REq) C Ve(Ma) € C™. 


But K is dominating since Ky,..., Ky are algebraically independent over Q. This 
forces Vc(Mg) = C™, hence 6 = 0. 


This corollary can also be proved using the moment map and the Cayley poly- 
tope of G. See the discussion following Theorem 9 in [113]. 


The Work of Karin Gatermann. Gatermann wrote four influential papers that 
introduced toric methods into the study of chemical reaction networks: 


161) Counting stable solutions of sparse polynomial systems in chemistry, 2001. 

A family of sparse polynomial systems arising in chemical reaction systems 
(with B. Huber), 2002. 

163) Bernstein’s second theorem and Viro’s method for sparse polynomial systems 
in chemistry (with M. Wolfrum), 2005. 

164| Toric ideals and graph theory to analyze Hopf bifurcations in mass action 
systems (with M. Eiswirth and A. Sensse), 2005. 
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These papers inspired the authors of to carry forward her ideas after her early 
death. In a MathSciNet review of her last paper [164], Maurice Rojas wrote: 
Karin Gatermann was a pioneer in computational algebra and extending its ap- 
plications to differential equations, dynamical systems, and chemistry. Tragically, 
she succumbed to cancer (at a young age) on New Year’s Day 2005. She is sorely 
missed by her colleagues. 
The introduction to Gatermann’s 2002 paper with Huber provides good sense of 
what she was hoping to accomplish: 
The study of positive solutions by chemists and applied mathematicians is com- 
pletely independent of the literature on sparse polynomial systems in algebraic 
geometry. ... The aim of this paper is to build a bridge between the two ar- 
eas, the applied mathematics literature and the algebraic literature on sparse 
polynomial systems. 
As an example of the success of this bridge, we note that the proof of Theorem [5.35] 
in is a variation of the Cayley trick argument that Gatermann and Wolfrum 
use in in their study of complex balanced systems. 
To better understand of the depth of Gatermann’s contribution, we will look 
at an example proposed by Edelstein in 1970 [137]. We will explore this example 
from three perspectives: 


1979: Classical tools of CRNT, based on Feinberg [147]. 
1989: Symbolic computation, based on Melenk, Méller and Neun [269]. 
2005: Toric methods, based on Gatermann and Wolfrum [163]. 


Edelstein 1970 and Feinberg 1979. In 1970, Edelstein proposed a bio- 
chemical model 


(5.29) A+L=—2A A+B=—=C=B+M 


where L and M are kept at fixed concentrations. Edelstein notes that this system 
can have multiple steady states and is analytically simple. He also comments that 
it has “not been biochemically demonstrated.” 

In his 1979 lectures, Feinberg Lecture 2] notes that it is easier to work 
with 


K12 


(5.30) A= 2A A+B=c=_B 
K21 KA3 Koa 


where the concentrations of L and M have been absorbed into kj2 and kK54. Thus 
(5.30) is a system which models the behavior of (6.29). In practice (including this 
book), is often attributed to Edelstein, though in reality it is Feinberg’s 
modification of Edelstein’s original network (6.29). 

The network (5.30) is an example of what Feinberg calls a “funny” reaction 
network. At the beginning of his discussion of these networks, he says 

. we shall admit for consideration networks containing peculiar reactions like 

A —> 2A or 0 —> A (zero reacts to A) which, at first glance, appear to be 

incompatible with conservation of matter. 2-20] 
Such reactions enable mass-action networks to model stirred tanks (6.20), which 
are “open” systems because reactants are added and products are removed. 

One reason Edelstein and Feinberg like this example is that it has different 
dynamical behavior for different values of the reaction rates k. We will see below 
that when K21k43K54 = K12K34K45, (5.30) is a toric dynamical system with one 
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steady state in each positive stoichiometric compatibility class. But for the reaction 
rates 


(5.31) A224 A+B—=2=C==B 
1 1 0.2 


from [148] 3.1], the behavior is more varied. The number of steady states in a given 
stoichiometric compatibility class can vary from 1 to 3, as shown here: 


(5.32) 


We will say more about this picture below. 


CRNT Analysis of Edelstein’s Example. For (5.30), we have 


i220 0 i) + 
(5.33) Y=|0 0101} S=Span{ jo|,| 1 
00010 o| |-1 


so for c > 0, the stoichiometric compatibility class is the intersection of x2+23 =c 
with the positive orthant, as shown in (5.32). Note that this is an example of an 
unbounded stoichiometric compatibility class. We also have the ODEs 


dx, 2 
a Ky1201 — Kq12] — K34%1X2 + K43X3 
dx2 

(5.34) Te = —K34%1%2 + K43X%3 + K45L3 — Ka 
dx3 
a K3401 22 — K4gt3 — K45X3 + K54LQ. 


Note that (5.30) is weakly reversible. Our methods make it easy to determine 
when we get a toric dynamical system. By looking the maximal trees in (6.30) with 
a unique sink at complex 1, 2, 3, 4 or 5, one gets 


Ky = ka1, Ko = kg, Kg = Kagh54, Ka = Kgahi54, Ks = K3akas. 
Now we compute some ideals: 
Tg = (K,27—Kox,, K3r3—K421 20, K3t9—K52 22, K4x2—Ksx3) : (41"203)™ 
= (Kk, K3 — KoK5, Kara — K5x3, 5x1 — K3, Kr — Ko) 
Me =Te AN QA), Ko, K3, K4, Ks] = (Ky K3 — K2Ks) 


By Theorem [5.34] it follows that (5.34) is a toric dynamical system if and only if 
ky K3 => Kooks, or equivalently, K21443Kh54 = 412Kh344h45.- 
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Note also that the reaction rates that give a toric dynamical system are defined 
by a single equation and hence have codimension one. Since the deficiency of (5.30) 
isd =m—€—s=5—2-—2=1, this is consistent with Theorem [5.35] 


Melenk, Moller and Neun 1989. In their paper Symbolic solution of large sta- 
tionary chemical kinetics problems [269], Melenk, Moller and Neun apply Grébner 
basis methods to chemical reaction networks. They study Edelstein’s example 
(6.30), though an unfortunate typographical error mars their analysis (the formula 
for p3 in 5.2] should have c76, not ¢1{). 

We instead do our own Grobner basis analysis of (.30). In the stoichiometric 
compatibility class 72 +23 =, tells us that the steady states are defined by 


2 
K12@1 + K43%3 — Kq12] — K3421 LQ = 0 


K43h3 + K45%3 — K3401L2 — K5aLq = 0 
%2+%3=C. 
(We ignore the third line of (6.34) since it is the negative of the second.). We want K 


and c to be arbitrary, which means computing a lex Grébner basis for x3 > ©2 > x1 
over Q(k,c). This gives the polynomials 


ee: 
gi = (Kas + K54)@2 — KL + K12%1 — CKa5 


g2 = (Kas + K54)03 4 K2104 K12%1 — CKi54 
93 = kork3ar} + (—kygk3a + korka3 + korkas + kotksa)x7 
+ (—kigkag — kigkas + ckgakas — kigksa)a1 — ckagksa. 


Since g3 is cubic in x1, there are 1, 2 or 3 values for x1. The polynomials g, and gz 
enable us to solve for x2 and x3 in terms of x. It follows that there are potentially 
1, 2 or 3 steady states in each stoichiometric compatibility class. 

The discriminant D of g3 is a (large) cubic polynomial in c. There are three 
real roots when D > 0 and one real root when D < 0. The coefficient of c? in D is 
—AKo1K3,K Fs < 0, hence 


(5.35) c>0 = D<0 => gz has only one real root. 


From 21, we get x2 and 23 using g; and go. It follows that the plane v2 +23 =c 
contains exactly one real steady state when c > 0. But it is positive? This is 
where Theorem |5.16]comes into play, for it guarantees the existence of at least one 
positive steady state satisfying 72 + 23 = c when c > 0. Hence this positive steady 
state is unique when c > 0, exactly as predicted in the picture (6.32). 

For the specific reaction rates in (6.31), the Grobner basis becomes 


gi = 1.242 — x? + 8.524, —c 
go = 1.2a3 + x7 — 8.541 — .2c 
g3 = 23 — 6.3a? + (c— 18.7) x1 — .2c. 


Thinking of g3 as a polynomial in x, and c, we get the picture (6.36) on the next 
page. The discriminant D of g3 has roots c = 10.951, 29.777, 30.695 in this case, 
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which when combined with the picture implies 
c< 10.951 == gs has three real roots, two of which are negative 
10.951 < c < 29.777 ==> gz has one real root, which is positive 
29.777 < c < 30.695 = > gs has three real roots, all positive 
30.695 <¢ = > gz has one real root, which is positive. 


You can see this clearly in the picture (5.36). 


(5.36) 5 g3 =0 — 
ZL 
0 
LZ 
0 10 20 30 40 


A cylindrical algebraic decomposition of the system 

MN = 92 =93 =9, c> 0, 4% > 0, r2 > 0, %3 > 0 
is easily done in MATHEMATICA using the CylindricalDecomposition command 
and reveals that when 2; and c are positive, so are x2 and x3. Hence we recover 


the full picture shown in (5.32), and we now know that multistationarity occurs in 
a narrow range of stoichiometric compatibility classes around c = 30. 


Gatermann and Wolfrum 2005. Before we can discuss what Gatermann and 
Wolfrum say about Edelstein’s example in [163], we need to explain the framework 
of their paper. One difference is that unlike and [163], we find it convenient 
to regard & = (K,;;) as an element of R&o rather than a vector of variables. 

Let’s begin with the structure of the matrix A,. Suppose the directed graph 
G has m vertices (the complexes) and r edges (the reactions). If we enumerate the 
edges, then G has an m x r adjacency matrix I, with rows indexed by complexes 
and columns indexed by reactions. The entries of J, are 


—1 in row 7 and column i- 7 
1 in row 7 and column ij 
0 otherwise. 


The reaction rates K = (Ki;) € R&o give the r x m weight matrix I,,, with rows 
indexed by reactions and columns indexed by complexes. The entries of [,, are 


Kjj in row iJ and column 7 
0 otherwise. 


Thus the ith column of J,, records the reaction rates of all reactions that start from 
complex 27. 
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The key property of these two matrices is that their product is A,, i.e., 


Ag = Iglu: 
This enables us to relate the two representations of the ODE system: 
e ~YA,®(2) = Y (InIq)®(z) 
= (¥Ia)(Ix®(2)) 
inj 
= [ Yj — Yi | Kyo IJ 


inj 
Notice also that the columns of YJ, span the stoichiometric subspace S. 


EXAMPLE 5.37. For the Edelstein example 


K12 K34 Ka5 
R21 KA R54 


one easily computes that 


—Ki2 Ka 0 0 0 
K12 —Ka1 0 0 0 
Ag=| 0 0  —K34 Ka3 0 |= 
0 0) K34.  —Ka3— Kan B54 
0 0 0 KA5 —K54 
4k o & @ aye 7S 
0 KQ1 0 0 0 
1 -1 O 0 0 0 
0 0 K34 0 0 
0 0 -1 1 0 0 
0 0 0 K43 0 
0 0 1 -1 -1 1 
‘& o& 2 © +1 al |? © & = 8 
\ y 0 0 0 0 Ki54 
a 
Le ra 
The fourth column of J,, shows that two reactions start from C. This will have an 
interesting consequence. <— 


Three Cones. Polyhedral cones have been part of CRNT for a long time. A basic 
reference is the 1980 paper of Clarke [79]. Three cones are especially relevant. 

The first is the orthant R&,). A positive steady state x, lies in the interior 
IR&, of this cone. The second cone is more interesting and comes from the Matrix 
Tree Theorem. When G is weakly reversible, Theorem [5.27] gives an explicit basis 
v1,...,ve of ker(A,) C R™ that is constructed from the partition of Ky,..., Kim 
induced by the € connected components of G. 


COROLLARY 5.38. When G is weakly reversible, the basis elements v1,...,V¢ 
of ker(A,) GC R™ are minimal generators of the cone ker(A,) NR. 


PROOF. For a connected component G, of G, vy = Yo; vortex of CG. Kje;. The 
description of K; given in (6.26) implies K; > 0, so that v, € ker(A,) A R&y. Since 
the v, involve disjoints sets of coordinates and span ker(A,), it follows without 
difficulty that they are minimal generators of the cone. 
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The third cone is ker(Y Ja) MR&o, studied by Clarke in 1980 [79]. Here, R” has 
coordinates z = (z;;);;, indexed by the r reactions of the network, i.e., the edges 
of G. These are the reaction coordinates, sometimes written z = (21,...,2,). In 
[79], Clarke uses ker(YJ,) MN R&, to analyze the stability of steady states. 

To see why the cone ker(YJ,) M R&o is relevant, note that 2, € R&, implies 
I,,®(a,) € R&o. Then for x, € R&,, we have 
x, is a steady state of (G,«K,Y) — > 0=YA,0(z,) 

<— 0= (YI). 8(a)) 
<=> [,,0(x.) € ker(YIa) ARS 
<=> Rsol,®(xs) C ker(YIa) OR§o, 


(5.37) 


where the last equivalence follows because ker(YJ,) 1 R&q is a cone. 


Deformed Toric Varieties. The next task is to characterize the elements of 
ker(Y Ig) R& 9 that correspond to steady states of (G,«, Y) for fixed reaction rates 
K = (Kij)i+7 € R&g. We start with the variety 
V,, = Zariski closure of the image in C” of [,,®(x) = (...,Kij2%,...). 
Ss, 
ij 

We call V,, a deformed toric variety since it is parametrized by monomials multiplied 
by constants. As we vary «, we get a family of varieties V,, C C’, 


The last equivalence of (6.37) suggests using the ray generated by I,,®(xr). We 
capture this idea using a second deformed toric variety: 
V, = Zariski closure of the image in C” of roI,®(x) = (..., Kij202%,...). 
—_—— 
Iw 
To find equations for V,, and V,,, we recast the matrix Y = [y, +++ Ym] € Ze i 
terms of the reaction coordinates. We do this in two ways: 
beset! ij 
vas . es eee 
2 : 2 
where some y;’s are omitted (if no reactions start at i) and others appear multiple 
times (if more than one reaction starts from ‘). 
Using these matrices, it is now easy to find ideals that define the above deformed 
toric varieties. Recall that any a € Z” can be written uniquely as a = at 
where at,a~ € Zso have disjoint support. 


=a; 


PROPOSITION 5.39. Let Viens Ye be as above. 
(1) If gi,.--, 94 1s a basis of ker(Yz : Z” > Z”) CZ’, then V,. = V(Ly,,«) for 


I z\ ot Z\ 9 (epee es 
vom = ((2) (2) |b stst) (ava) 


(2) If hi,...,hz is a basis of ker(Y, :Z" + Z+") CZ’, then V,, = VUIy,..) for 


tox = (5) = (3) 


Proor. (1) When « = 1 = (1,...,1), Vi is the toric variety of Y,, so that by 
Exercise 1.1.3 of [106], Jy, 1 = (298 — 29 1<i<t):(%---2,)™ is the toric ideal 


1<ést): (e--+ap)™. 
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that defines Vi, The coordinate change given by 2; +> Ki+j;Z2i+; takes Vi to Vi, 
and (1) follows easily. The proof of (2) is similar. 


One reason why reaction coordinates are so nice is that changing the reaction 
rates & is done by simply rescaling coordinates. This is what makes the proof of 
Propostion SO easy. 

Let’s see what Proposition [5.39] says about the Edelstein example. 


K Ks Ki rf . 
EXAMPLE 5.40. A — 2A,A+B — C — B has reaction coordinates 
21 43 54 


(2ijseeg RQ) _ (Z1 +2) 221) 234; 24-43, 24-35) 25 »4), 


The matrix Y from (5.33) gives matrices 


121000 » Nee aa 

Y¥,=}]0 0 1 0 0 1] and Y,= 
0001 10 Poe oe: 
00 0 1 1 ~0 


Integer bases of the kernels are 
Yz: {(0,0,0,—1, 1,0), (—1, 1, —1, 0,0, 1), (—2, 1,0, 0, 0, 0)} 
Yr: {(0,0,0, —1, 1,0), (—1, 1, —1,0,0,1)}. 

By Proposition [5.39] the second kernel gives the deformed toric ideal 


25 Z4 22 26 21 23 oo 
Fi = ( a “—) (17-26) 
RA5 R43 K2Q1 K54 R12 K34 


= (K43.25 — 84524, 412K342226 — K21K5421 23) 


which defines the deformed toric variety V,,. The story for Iy,,, is similar. 


The Cone and the Deformed Toric Variety. The cone ker(Y J,)R&v interacts 
with the deformed toric variety V,, in two ways. Here is the first interaction, which 
is a version of Lemma 3.1] (see also the comments following the lemma). 


THEOREM 5.41. Suppose z, € ker(YIg) N RS. 

(1) For any k > 0, (G,K,Y) has a positive steady state x. with I,,®(r.) € Rsoz 
if and only if z. © Vis. 

(2) There exists K > 0 such that z, € V,.. 


PROOF. One direction of (1) is obvious, since [,,®(a,) € Roz. implies there 
is \ > 0 such that z, = \J,®(x,). Then z, € V, by the definition of V,,. 

For the other direction, first observe that when « = 1, Yz; induces a monomial 
map Ryo x R&y > Win IR,» which is easily seen to be surjective by adapting the 
methods used in the proof of Theorem [5.34] 

Given z, € VN ker(YIa)MR&o, we have z./K € Vi R&o, which by the above 
paragraph implies the existence of (A, z..) € Ryo x R&o such that 


Pol Ko Gag AEE ink) 
SYS 
ivy 
This implies z, = AI,,®(x,). Since z, € ker(YJ,) NR&,, G.37) implies that x, is a 


steady state of (G,«,Y). 
For (2), note that z, = xol,,@(@) when & = z%, ) = 1, and x = (1,...,1). 
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The second interaction between V,, and ker(YJa)M R&, involves the minimal 
integral generators {£;} of this strictly convex rational polyhedral cone. Each E; 
lies in the kernel of YJ,, which implies that the E; come in two flavors: 


e [,E; = 0. It is well known that ker(£;) is generated by positive circuits, 
which are directed cycles in G. Being a minimal generator guarantees that 
FE, comes from a minimal positive circuit. 

e [,E; 40. We call E; a stoichiometric generator in this case. 


As explained in the final paragraph of the introduction to [163], stoichiometric 
generators can induce multistationarity. 


EXAMPLE 5.42. In the Edelstein example, the integral minimal generators of 
ker(Y Ia) M R&, are 


Ey = (1,1,0,0,0,0), Ex = (0,0,1,1,0,0), E3 = (0,0,0,0,1,1), 
4 = (1,0, 1,0, 1,0); Es = (0, 1,0, 1,0, 1). 


The first three generators come from the obvious two-cycles in the network shown 
in Example [5.40] The two other satisfy 


Lame. 1H1.01j=-1 


and hence are stoichiometric generators. The vector J, £4 lies in the kernel of Y 
and gives the vector (—1,1,—1,0,0,1) the kernel of Y,. In Example 5.40] we saw 
that this vector was responsible for the polynomial K12k342226 — K214542123 in the 
deformed toric ideal Le <P 


The relation between stoichiometric generators and the deformed toric ideal 
Ty, , 8 explained in Lemma 4.5] and the comments following the lemma. 


Toric Degenerations. We conclude our discussion of the Edelstein example in 
by examining the toric degenerations of its deformed toric ideal. 


EXAMPLE 5.43. For A = 2A,A+B _— @ = B, we saw in Example.40] 
21 AZ 54 


that the homogeneous deformed toric ideal is 
Ty, 4g = (Ka3%5 — 4524, K12K34222%6 — K21K542123). 


This ideal J has four initial ideals corresponding to regular triangulations of the 
convex hull of the columns of Y;. A basic reference is [333) Chapter 8). 
If we write Y = [yi yo y3 ys ys], then Y, has six columns 


Pad Led Lad el Lae 


associated to the reaction variables 21, 22, 23, 24, 25,26. The column containing y4 
gets duplicated since two reactions start from complex 4. 

When taking the convex hull, we can ignore the 1 in each column and work 
with P = Conv(y1, y2, ys, y4; Y4; Ys). Labeling each column with the corresponding 
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reaction coordinate gives the following picture: 


ZA, 25 


We get a regular triangulation by lifting the six vectors into the fourth dimension 
and projecting the lower convex hull into R*. Since y, = (0,0,1) gets lifted twice, 
the lower convex hull picks z4 or z5, depending one which lifting of y4 is lower. Here 
is one of the regular trianglulations: 


25 


(5.38) 


The new edge, shown in red, indicates that the lift of z2z6 lies above the lift of z1 zs. 
This selects z2z6 as the leading term of &12K342226 — K21K542123- 

As explained in Theorem 4.18], each 3-simplex of the regular triangulation 
gives a subsystem of the original system. If we label each simplex with the reaction 
variables of its vertices, we get the subsystems 


21222325 | A= 2A At Rs sR 
21 


Kas 


gi estete 2 A 2A A+B“SC==B 
54 


Let’s explore the dynamics of the top subsystem using the reaction rates from 
oe 8.5 
(6.31). This gives A ao 2A, A+B—+C —>B. To find the steady states, we 


set K12 = 8.5, Koi = h34 = 45 = 1 and K43 = h54 = 0 in (6.34) to obtain 


dx, 
— = 8.52, —a#? —me 
dt 1 1 142 
dxo 4 
—— = —-2412 x 
dt 142 3 
dx3 
= 1172 — @3. 


dt 
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When we focus on the stoichiometric compatibility class x2 + 23 = c for c > 0, we 
get the system of equations 

0 = 8.521 - x? — 2122 

0 = —21% + 4X3 

c=%4+ 23. 
Elimination theory implies that x, satisfies the cubic equation 

xe — 7.527 + (c— 8.5)a1 = 0. 
When c = 22.5, it follows that the steady states satisfying x2 + 73 = 22.5 are 
(41, 2,03) = (0, 22.5,0) or (3.5,5,17.5) or (4, 4.5, 18). 


In the first steady state, some concentrations are zero. The other two are positive, 
which shows that the subsystem selected by the regular triangulation exhibits 
multistationarity. The major result of is Theorem 4.18, which relates the 
behavior of such subsystems to the dynamics of the original system. << 


The idea that the dynamical behavior of a chemical reaction network can be 
influenced by what happens on certain subsystems goes back to Clarke in 1980. 
This is why he was so interested in cone ker(YI,) 1 R&,. Gatermann made the link 
to toric varieties and Grobner bases in her pioneering work. These days, subsystems 
of reaction networks are studied using toric varieties, toric ideals, Grobner bases, 
and matroids. The papers and the references therein give a hint of 
the range and depth of what people are thinking about. 


Something Unexpected. We finish this section by observing that the differential 
equation (5.11) from Section can be written as 


dx 


(5.39) ==> wij (yy —y)- 
dt ae ~~ 

wd weights blending control 

functions points 


The language of blending functions and control points is from geometric modeling, 
as studied in Section 

In 2005, Craciun and Feinberg related the potential for multistationarity 
to the injectivity of the map 


x €R&g+> > nija™ (yy — y) 
i oer a 

for fixed reaction rates « > 0. They write this map as c++ p(c,k) and note that 

If a reaction network has the capacity for multiple positive equilibria, then there 

exists some choice of positive vector ko such that the function c+> p(c, ko) is not 

injective. 
Feinberg is one of the creators of CRNT [147149], and similarly, Craciun is one 
of the creators of toric dynamical systems [113]. So their collaboration makes 
sense. But Craicun also wrote the paper Some geometrical aspects of control points 
for toric patches with Garcia-Puente and Sottile [[15]. This paper uses to 
apply the injectivity theorems of Craciun and Feinberg to geometric 
modeling. The paper also discusses degenerations of toric patches that were 
studied intensely by Garcia-Puente, Sottile and Zhu in [160]. The splendid images 
presented in Example[B.6]are based on [160]. 
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In more recent work [109/110], Craciun introduced the powerful idea of a toric 
differential inclusion to study the Global Attractor Conjecture from Section b.1] 
There is clearly a lot of interesting mathematics going on here! 


Final Comments. In this section, we introduced toric dynamical systems, learned 
about the work of Karin Gatermann, and spent time on a classic example due to 
Edelstein. Although our treatment has been far from complete, we hope to have 
convinced you that this subject is worth learning. 

The next section will explore some of the fascinating phenomena that arise in 
the study of biochemical reaction networks. 


5.3. Algebraic Methods for the 
Study of Biochemical Reaction Networks 


by Alicia Dickenstein 


In this section we will concentrate on biochemical reaction networks of interest 
in systems biology, in particular enzymatic networks, consisting of different types 
of multisite phosphorylation networks. One source of inspiration for our study with 
algebro-geometric tools is the following quote from the abstract of the paper [300): 

Multisite phosphorylation cycles are ubiquitous in cell regulation systems and 

are studied at multiple levels of complexity, from molecules to organisms, with 

the ultimate goal of establishing predictive understanding of the effects of genetic 

and pharmacological perturbations of protein phosphorylation in vivo. Achieving 

this goal is essentially impossible without mathematical models, which provide 

a systematic framework for exploring dynamic interactions of multiple network 

components. 

We will mainly concentrate on recent advances on the determination of mul- 
tistationarity for these networks, whose dynamics are usually modeled with mass- 
action kinetics. For many classes of chemical networks, as the complex balanced 
networks discussed in Sections [5.1] and [5.2] monostationarity is an important prop- 
erty. Instead, for biochemical reaction networks, that is, chemical reaction networks 
modeling pathways in systems biology, multistationarity is a general feature and it 
is important because it is intepreted as a way for the cell to take different decisions. 
Indeed, differential systems with mass-action kinetics are deterministic. But the 
occurrence of multiple stable steady states in the same stoichiometric compatibility 
clas]] implies that trajectories starting from different initial conditions with the 
same conserved quantities can converge to steady states with different properties. 

We will end the chapter with some open questions. We refer the reader to the 
surveys for further details and references. 


Multistationarity. We have seen the definition of multistationarity of a given 
reaction network in Section [5.1] Our point of view will be the following. We 
will fix a basis B of the orthogonal $+ of the stoichiometric subspace S, and we 
call total amounts the linear conservation constants associated to a trajectory 2, 
that is, the values c, of the conserved quantities v- x, for any v € B. Given a 
reaction network, we will consider both the vector « of reaction rate constants and 
the vector c = (Cy)veg of total amounts as parameters. We will thus look at the 
family of systems of differential equations of the form (parametrized by #) 


1Such steady states are said to be stoichiometrically compatible 
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and stoichiometric compatibility classes {v- x = cy,v € B} MR&o. Note that 
So:= {u-u =cy,v € B} = x(0)4+ S, for any «(0) satisfying v - 7(0) = c,, for all 
v © B, as described in (5.16). Recall that we only allow the rate constants « to 


take positive values. 


DEFINITION 5.44. We say that a given reaction network has the capacity of 
multistationarity if there exists a choice of parameters &,c for which the system 
with rate constants « has more than one positive steady state in the stoichiometric 
compatibility class S.MR& . Any such choice of &,c¢ are called multistationarity 
parameters. When there is at most a single positive steady state in each class, the 
reaction network is called monostationary. 


Main Questions About Multistationarity. Given a reaction network, these 
are the main questions about multistationarity: 


(1) Does the network have the capacity of multistationarity? 
(2) If so, 

(a) detect multistationarity parameters. The answer to this query could be 
just one choice of parameters, an open set in the space of parameters 
(called a multistationarity region) or hopefully, all the parameters for 
which multistationarity occurs. 

(b) find the maximum possible number of positive steady states in the same 
stoichiometric compatibility class, or at least, give a refined lower bound 
for this maximum number. Detect multistationarity parameters with 
maximum/many stoichiometrically compatible positive steady states. 


As we remarked in (5.17), the steady state variety V°* is an algebraic variety, 
as well as the translates S,. So the previous questions are indeed questions about 
the number of positive solutions of multivariate sparse systems of real polynomials. 
This is a question of quantifier elimination in real algebraic geometry, which is in 
principle effectively computable for a given system, but with a high complexity (in 
particular for parametric systems). Giving sharp bounds on the number of positive 
zeros for parametric polynomial systems from their structure is a very difficult 
theoretical problem. One obvious upper bound is the Bernstein-Kouchnirenko- 
Khovanskii bound for the number of complex solutions with nonzero coordinates 
(see for instance p. 206 in ), which is usually much bigger than the number 
of positive solutions. The main result in the area is due to Khovanskii [233]. 
He proved that given a sparse system of n polynomials in n variables with real 
coefficients and exponents in a finite set A, there is a bound for the number of 
nondegenerate positive real roots in terms of the cardinality of A. This bound is 
very far from being sharp. There are some sharper particular results (see e.g. [31)), 
but the general question of finding a tight bound is out of reach for the moment. 

We now present some important basic biological mechanisms. 


EXAMPLE 5.45. Phosphorylation Cycles. Multisite protein phosphoryla- 
tion/dephosphorylation plays a prominent role in intracellular processes like signal 
transduction and cell-cycle control. The distributive sequential n-phosphorylation 
cycle has been extensively studied. It describes the phosphorylation of a protein 
with n sites where a phosphate group can be added or released (we denote these 
species by So, 51,...,5n, where S; means that it has 7 sites with a phosphate 
group) by a pair of enzymes (a kinase E and a phosphatase F’) in a sequential 
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and distributive mechanism. This means that there are intermediate species at 
each phospho/dephosphorylation step, where only one phosphate group is added or 
released. The reaction network for the case n = 2 is as follows: 


Kong Katy Kony Keaty 

So E Z ESo S,; +E Z ES, > Sot FE 
Kottg Kote, 
tony leaty long leatg 

So F o> FS» Sy Fo FS, + So F 


lott, lott 


We denote the concentrations of the nine chemical species Sp, ,52,f,F, and 
ESo, ES), FS, F'S\, by «1, %2,23,04,0%5, and y}1, y2, y3, y4, respectively. The asso- 
ciated mass-action kinetics dynamical system describing the evolution in time of 
the concentrations equals: 


dx1 dx4 
TIL =—kon yp %14 Ar Kofty V1 + leat Y4 Ts =—konyt1t4—kon, 204 + (Kotty + Keato yy 
dx2 
TL =—kon, LQL4A Keaty Y1 + Koff, Y2 + (Kost, + Keat, )y2 
dxs 

—lony ©2%5 + leaty Yy3 + lofty Y4 di =—long £245 — lon, 305 + (lost, + leaty )Y¥3 
dx3 _ 
TL =keat, Yo lon, 325 + low, ¥3 + (lott, + leaty )Y4 
d dy: 
= =Kony %124 a (Kofty =A Keato )Y1 = =lon1 L3XL5 — (loft, “lr leat, )Y¥3 
dy2 dya 
a =kon, TQ2L4 — (Korky “ Keat, )y2 7 long £275 _ (lofty IF leato )Y4- 


The space of linear conservation relations has dimension 3. The basis which is 
usually taken gives rise to the following linear invariants of the trajectories: 


%1+%24+234+ y1 + yo + y3 + Ys =Stot 
Lat yr + Yy2 =Ltot 


5 + y3 + Ya =Frot, 


where Stot, Etot, Ftot are positive real numbers for any choice of initial condition in 
the positive orthant. Note that there are 12+ 3 = 15 parameters. For any n € N, 
the system is similar, with 3n +3 species (so 3n +3 variables, corresponding to the 
concentrations of n + 1 substrates So,...,5,, 2 enzymes FE, F' and 2n intermediate 
species), and 6n + 3 parameters (6n reactions and so, 6n reaction rate constants, 
plus 3 independent conservation relations and so, 3 total amounts). Notably, there 
are many general results even with this number of variables and parameters. We 
will outline the methods to get these results below. 

When n > 2, the distributive sequential n-phosphorylation cycle has the ca- 
pacity of multistationarity. This is shown in the nice article by Wang and 
Sontag, where it is proved that there cannot be more than 2n — 1 positive steady 
states, while there are parameters for which the system has n + 1 positive steady 
states in case n is even and n in case n is odd (that is, 2|n/2] +1, where |-| denotes 
integer part). Multistationarity for n > 2 is also explored in [209]. In it is 
shown that the upper bound 2n — 1 is attained for certain parameter values in case 
n = 3,4 (this also holds for n = 2 since 2.-2—1=2+41). It is an open question 
whether this upper bound is attained for n > 5! 
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In the paper [30], parameter regions on all the parameters are given for the 
occurrence of multistationarity for the distributive sequential n-site phosphoryla- 
tion systems (with 3 positive steady states), using a polyhedral approach that we 
briefly describe below. Using these same tools, the paper describes regions 
of multistationarity for any n where the number of stoichiometric compatible pos- 
itive steady states equals 2|n/2| + 1. On the other hand, in the article a 
choice of parameters is found for which there are this number of nondegenerate sto- 
ichiometrically compatible steady states and moreover, |n/2| + 1 are stable, with 
techniques of singular perturbation theory (it had been shown evidence for this un- 
limited mulstistability in [346]). When there are no linear conservation relations, a 
(nondegenerate) steady state x, is stable if all the the eigenvalues of the Jacobian of 
fi,---;fn at x, have negative real part. In case the stoichiometric subspace S 4 0 
(and the linear forms in S+ define all linear relations among f,..., fn, a (nonde- 
generate) steady state is stable if the Jacobian at x, has n — dim(S) eigenvalues 
equal to 0 and the rest have negative real part. In this case, trajectories sufficiently 
close to x, will converge to it. 

In the processive n-site phosphorylation cycle instead, the unphosphorylated 
substrate So binds to the enzyme F and eventually gets n-phosphorylated sites 
(S,,) via monomolecular intermediate complexes consisting of a single intermedi- 


ate species U,,...,U,, and the dephosphorylations reactions are modeled similarly 
(through intermediate species V,,...,V, by the action of the enzyme F’): 
Sp +E SS Uy > Uy “> - SU, 8, +E 
£ j Ln Ln 
S,+F 2 Vy 4 Vi jet a ee. ae 


In this case, there also 3 linearly independent conservation relations, 2n + 4 species 
and (2n + 4) +3 = 2n+7 parameters. It is proved in that the system is 
monostationary for any value of n and moreover, that the only steady state in each 
stoichiometric compatibility class is a global attractor for all trajectories starting 
in the positive orthant. We refer the reader to the survey for futher recent 
progress and directions. << 


EXAMPLE 5.46. Enzymatic Cascades. The simplest case of a cascade with 
the capacity of multistationarity consists of a cascade with two layers and a single 
phosphorylation/dephosphorylation at each layer, with one phosphatase [152]. It 
corresponds to a labeled directed graph, with 9 variables and 18 parameters, where 
each phosphorylation follows the same mechanism as in the processive distribu- 
tive cycle in Example [5.45] with an intermediate species. The ten species are the 
substrates So, 5; in the first layer, the substrates Po, P, in the second layer, four 
intermediate complexes, a kinase F and the same phosphatase F' which dephos- 
phorylates the substrates in both layers. The forward enzyme in the second layer 
is the phosphorylated substrate S; from the first layer. 

The cascade is shown on the next page on the left. On the right, we show a 
typical depiction of the mechanism that hides the reaction rate constants and the 
intermediate species. 

In this case, there are 4 linearly independent conservation relations. In case 
there are two different phosphatases F, F> that interact with S, and P, in the de- 
phosphorylation path, the system has 5 linearly independent conservation relations 
and is monostationary for any choice of parameters. In we described regions 
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of multistationarity in terms of all the parameters for cascades with n-layers (and 
so with an increasing unbounded number of linearly independent conservation rela- 
tions), when there is the same phosphatase F' acting in two of these layers. Again, 
we used for this the polyhedral approach in [80]. 


“cata 


ki. ki 
S+tEoS E'So —> Si +£ 


kote 
E 
lang Lato a 
Si + Ff -—> FS, ——+S)+ F So St 
lee SS ay oe 
F ~« 
ong heat P. —, P. 
Po +S; ==> Si Pp — 3P4+S5) 0 ar 
keto 7 


ten eat 
Pi+F o> FP, —OR+F. 
Weeg 


Another important enzymatic cascade is the extracellular signal-regulated ki- 
nase (ERK) pathway, which leads to activation of the effector molecule ERK, which 
controls downstream responses in the cell. It has a crucial role in the occurrence 
and development of diseases or developmental defects [285]. The protein RAS is 
produced inside the cell after a mechanism that started through the membrane, 
and in turn phosphorylates RAF, which activates MEK and then it allows for the 
double phosphorylation of ERK. 


RAS ppERK 
, a sT™G 
RAF pRAF Z 
nN 


RAFPH 


—, 
MEK ppMEK 
nN Sa 
PH PH 


ERK pERK ppERK 
N_- NY 
PH PH 


The dotted arrows indicate a possible retroactivity in the cascade by the activated 
molecule ppERK. This network admits multistationarity, but it is an open problem 
to determine which is the maximum possible number of stoichiometrically com- 
patible positive steady states. The number of complex solutions to the equations 
fi(x) = 0 and the linear equations defining a stoichiometric compatibility class 
can be computed, but this is certainly a non-sharp upper bound for the number 
of positive solutions. Even if there are only three layers, the number of variables 
and parameters is too big to use in a direct way any theoretical approach, as for 
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instance via the computation of discriminants [165]. It might be possible to com- 
putationally attack this question via an improvement of the RAGIib library d 
<< 


EXAMPLE 5.47. Bifunctional Enzymes and ACR. We now consider an os- 
molarity regulation network in bacteria, extracted from [320], which is an example 
of a so-called two-component system. There are two proteins, the sensor kinase 
EnvZ (denoted by X;) and the response regulator OmpR (denoted by Y,). EnvZ 
autophosphorylates (X3) and catalyzes the swap of the phosphate group to the 
response regulator (the phosphorylated form OmpR-P is denoted by Y2) via an 
intermediate species U;. In this mechanism, EnvZ can also be bound to ATP (X2) 
or to ADP (Xo), and in both cases it acts as an enzyme, dephosphorylating Y> 
back to Y; via intermediate species Uz and U3. The associated directed graph is as 
follows: 


XX, =x, = x, 

Xg+¥, =U, “X42 
X_ + ¥, =U, “4X, +, 
Xo + Y¥2 = Us “4 Xp + Yq 


As usual, we denote the concentrations of the species with small letters. There are 
two linearly independent conservation relations: 


(5.40) wo +2, +a%9+ 23 +4, + Ue + U3 = Xtot, Yr + Y2 + ur + U2 + Uz = Veot- 


The steady state variety V°* can then cut out in R® by 7 linearly independent 
polynomials among /fi,...,f9 and for positive rate constants has dimension two. 
The particular design of this biochemical network keeps for any positive steady 
state the concentration yz constant with value a(«) > 0 described in (6.41) below, 
which depends on the reaction rate constants but is independent of the positive 
initial concentrations (as long as Yict > a(K)). It is then said that the system 
shows Absolute Concentration Robustness (ACR) in the concentration yg of the 
phosphorylated response regulator. We show in that it is not possible to detect 
this only via linear combinations of the polynomials f,,..., f9 with coefficients in 
the field R. But this becomes evident for generic choices of the rate constants 
via the computation of a Grdbner basis of the ideal Jy spanned by fi,..., fg in 
the polynomial ring Q(K1,.-..,K14)[%0, 11, V2, 3, Yi, Y2, U1, U2, U3] With coefficients 
in the field of rational functions of the reaction rate constants. 

We now explain the elimination of intermediate species developed in in 
this case, which allows us to do the computation more easily. Order the variables 
X= (0,1, ©2, ©3, Y1, Y2, U1, U2, ug). Then 


4 = f(x) = Ke rgyi — (K7 + Ks) Ur 
ue = f(x) = Ko @oy2— (Kio + K11) U2 
mis = fo(x) = Ki2 toy2 — (K13 + K14)Us. 


It follows that at steady state, we can compute the values of the concentrations 
ui, U2, U3 from the values of the concentrations x9, 2,73, 41, y2 and the rate con- 
stants. Also, define the following rational functions of «, which are well defined for 
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any choice of positive values of the reaction rate constants: 


_ _ kKeks — RoR — #12414 
y= 7 2 = F , 32> : 
K7 + Kg Kio + Ki K13 + K14 


Then, if we skip the concentrations of the intermediate species and we order the 
/ 


variables x’ = (X9, 21, 22,23, Y1,Y2), let a g(x’) be the mass-action kinetics 
system associated with the reduced network: 


KL K3 K5 
Xo = X, == X 
Ka Ka 


> Xs 


Vy ENG 

SVG eT 

Xo +¥, 4X) 4+ Yi 
Here, x € V°° if and only if 


K6 K9 R12 roy); with g(x’) =). 


2 = (2, ——23yn, T2Y2, 

( K7 + Kg R10 + Ki11 R13 + K14 
Now, g = (g1,---,96) satisfy two linearly independent linear relations g; + go + 
g3 + ga = 0, g5 + g6 = 0, and thus the ideal J, is spanned, for instance, by the 
polynomials: 


gi. = 1X9 + KX 
93 = 31 — (Ka + K5) ro 
94 = K5tQ— m1 %3Y1 


95 = ~TM23Y1 + T2L2Yo + T3LOY2- 
If we compute a Grobner basis of J, with respect to the purely lexicographic order 
where (x2, y2) are the smaller variables (directly or via the FGLM algorithm), we 
find the following polynomial 
h = @2((K1 72 K3 + 73 K2 Kha + 73 Ko K5) Yo — K1 K3 Ks), 


and we deduce that when x2 4 0, the value of y2 at steady state equals the constant 
Ki K3 K5 


(5.41) a(K) = ; 
K1 72 K3 + 73 K2 Ka + 73 K2 K5 

which is well defined since all «;,7; > 0 and depends only on the reaction rate 
constants. Note that when computing a Grobner basis with a term order that only 
depends on the variables x’ and not on the parameters, we are indeed working in the 
polynomial ring with coefficients in Q(«). We invite the reader to check that this 
is in fact a binomial ideal in the variables x’ by checking that the reduced Grobner 
basis consists of binomials [141], but that the ideal generated by gi,...,g¢ in the 
polynomial ring Q[k1,...,45,71,72,73, "| is not binomial and the value of y2 at a 
zero of the ideal is not constant when the variables x’ are nonzero. It follows that 
the ideal generated by fi,..., fo in Q(«)[2] is binomial but the ideal it generates 
in QJk, x] is not. 

This network also has the following property: it has relevant boundary steady 
states, which is not shared by any of the reaction networks we presented in Ex- 
amples [5.45] and 5.46] A boundary steady state in a system with n variables is 
a point « € Vs \ R&, (see G.18)), that is, a steady state with nonnegative co- 
ordinates and at least one coordinate equal to 0. A boundary steady state is 
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said to be relevant if it lies in a stoichiometric compatibility class which inter- 
sects R&y. Indeed, the stoichiometric compatibility class defined by the affine rela- 
tions (6.40) intersects the positive orthant if and only if Yict > a(«). In this case, 
x = (0,0,0, Xtot, 0, Yor, 0,0, 0) is a relevant boundary steady state. The cardinality 
of VSé is 2, but only one of the steady states lies in the open orthant R4%> and so 
the network is monostationary. << 


General Results From Structure. In recent years, several authors developed 
general results for biochemical reaction networks with algebro-geometric tools from 
the structure of the network, in particular enzymatic networks as those presented in 
Examples [5.45] and 5.46] starting with the paper by Thomson and Gunawar- 
dena. Other articles with this perspective are the papers by Gnacadja, 
and the article by Feliu and Wiuf. Together with Pérez Millan, and building 
on this previous work, we identified in a common structure in many popular 
biological networks, called a MESSI structure, that describes Modifications of type 
Enzyme-Substrate or Swap with Intermediates. All the examples we presented are 
instances of MESSI systems. 

The main idea is that the set S of species can be naturally partitioned into 
a (possibly empty) set S©) of intermediate species and several (nonempty) sets 
SO. ...,S(™ of core species. Each subset S() of core species has its associated 
intermediate species Int(a@) and the mass-action kinetics system is (affine) linear in 
the variables in S$‘) U Int(a). These linearities inside the nonlinear system are key 
to proving different properties. Note that the subsets Int(q@) are not disjoint, and 
this interaction is many times responsible for the multistationarity of the system. 
We refer the reader to for the general definition. There could be different 
partitions of the set of species inducing a MESSI structure on a given network; in 
fact, there is a poset of them and one could always consider a minimal one. 

Given a partition defining a MESSI structure on a biochemical reaction net- 
work with underlying directed graph G of reactions, we can associate to it three 
other directed graphs G), G2,Gg in an algorithmic way. First, G, corresponds to 
the elimination of intermediates, modifying the reaction rate constants as in our 
Example based on the results of [153]. In Example a (minimal) par- 
tition is given by S© = {U,,U2,U3}, SY = {Xo, X1, Xo, X3}, S? = {¥1, Vo}, 
and Int(1) = Int(2) = S. The underlying graph of the reduced network in this 
example is the directed graph G,. We also refer the reader to the papers 
for the positive elimination of variables in the steady state equations. 

The directed graph G2 has, in the case of minimal partitions, one connected 
component for each subset S(, a = 1,...,m, and in the component corresponding 
to S(™, all the species not in this subset of the partition are “hidden” in the labels of 
the directed graph. Again in the case of Example[b.47] which besides two enzymatic 
reactions has a swap, the corresponding directed graph G2 equals: 


Ky K3 K T1423 
X) == X, =x, “> x, Y, —¥— y, 
K2 Ka T2%2Q+T3LO 


T1141 


Formally, the mass-action systems associated with G, and G»2 define the same 
steady state equations (and thus, the same steady state ideal). 
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Finally, the directed graph Gg has m nodes corresponding to the subsets of 
core species in the partition and there is an arrow from S‘ to S“) if and only if at 
least one species from the first subset occurs as a label in the component associated 
to the second subset in G2 (after removal of loops, if any). The edges in Gg are 
not labeled. In the case of Example 5.47] the corresponding graph G'g equals 


(5.42) SY = §@), 


Theorem 3.2 in proves that a MESSI system has one (independent) lin- 
ear conservation relation associated to each of the subsets in the partition of the 
species set corresponding to core species (as in (.40)). This implies that all sto- 
ichiometric compatibility classes are bounded and all trajectories are defined for 
any t € Ryo. Moreover, sufficient conditions are given to ensure that there are no 
other independent linear relations. This gives the bases of linear relations in the 
networks in Examples [5.45] and [5.46] Theorem 3.15 in asserts that when G2 
is weakly reversible and Gz has no directed cycles, then there cannot be relevant 
boundary steady states and so any trajectory starting in the positive orthant is at 
a positive distance from the boundary of the orthant (this is called persistence). 
Note that in (6.42), the directed graph Gz does have a cycle and there are rel- 
evant boundary steady states, so this result does not apply to this case, but all 
the enzymatic networks in our first two examples do satisfy the conditions of the 
theorem. Theorem 4.1 in gives sufficient conditions for V°* MN R&, to have a 
rational parametrization, in a very explicit and algorithmic way. This is even more 
explicit for s-toric MESSI systems, in which V°* M R&, can be cut out by bino- 
mials read from the structure of the network (see Definition 4.3 and Theorem 4.8 
in ). Moreover, in this case, it is possible to exactly characterize the capacity 
for multistationarity and give choices of multistationarity parameters. 

Below we state a version of Theorem 5.4 in [290]. This result, in terms of 
the associated oriented matroids of the matrix of exponents and the matrix of 
coefficients of the system, is based on many previous results, starting with those 
in the paper by Craciun and Feinberg, and ending with the comprehensive 
paper [277], including several others [278][291]. We keep the previous notation; in 
particular, « denotes the vector of reaction rate constants of a given network on n 
species. Given a matrix with n rows (resp. n columns) and a subset J C {1,...,n} 
we indicate with the subscript J the submatrix obtained considering only the rows 
(resp. the columns) indicated by the elements of J. We denote o(J) = J jc7J- 


THEOREM 5.48. Assume that the steady state variety V = V°° 1 R&o of a 
chemical reaction network is nonempty and cut out by binomials with coefficients 
in Q(x) that are defined for positive values of the rate constants, and exponents 
arranged as columns of a matrix, from which we choose a submatriz B of maximal 
rank. Equivalently, V 4 0 is parametrized by monomials with exponents in any 
dual matriz A of maximal rank such that the columns of B generate its kernel. Let 
S+ denote a matrix whose rows define the dual of the stoichiometric subspace S. If 
rank($+) = rank(A) = d, then the following statements are equivalent: 


(1) Monostationarity: for any x(0) in R&o and any choice of rate constants kK, there 
is at most a single steady state in VM (a(0) +S). 

(2) For all subsets J C {1,...,n} of cardinality d, the product det(S7) det(A,) 
either is zero or has the same sign as all other nonzero products, and at least 
one such product is nonzero. 
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(3) Same sign conditions with det(S7) det(B,). 
(4) Same sign conditions with (—1)?) det (SF )det(Bri,.. ny\s)- 


The equivalent statements in Theorem [5.48] are also equivalent to the fact that 
there are no vectors w € S and v € rowspan(A) which have coordinatewise the 
same sign (see for instance [279]). When such vectors do exist, they can be used 
to construct multistationarity parameters « (as in [290\[291]). The corresponding 
total amounts are then computed from the steady states. All the results we men- 
tioned are algorithmic and there will be soon a publicly available open source free 
implementation by Guillermo Mosse, based on Algorithm 1 in [290)}. 

There is another equivalent condition to the four equivalent statements in The- 
orem [5.48]in terms of the signs of the coefficients of a polynomial with (0,1) expo- 
nents which can be computed as a determinant. We refer to for details 
about these determinant conditions and to for a use of this approach to detect 
minimal subsets of intermediate species that allow for multistationarity. 


Other Approaches to the Question of Multistationarity. We quickly review 
other approaches to the question of multistationarity. 


Using Brouwer Degree Theory: Another interesting approach to decide the ca- 
pacity for multistationarity without parameter sampling is based on degree theory. 
We refer the reader to Theorem 1 in |83]. In this paper, the authors introduce a 
procedure to partition the reaction rate space into regions for which the system has 
a unique or multiple equilibria. Their procedure is applied to several models of gene 
transcription and cell signalling, for which a complete partitioning of the reaction 
rate space with respect to multistationarity can be obtained. This approach does 
not provide conditions on the total amounts. It has been simplified for so called 
linearly binomial networks (which include many s-toric MESSI systems) in [131]. 


Using Polyhedral Methods: This approach is based on the paper to get 
lower bounds for the number of real positive roots of sparse systems of polynomials, 
by lifting positive real roots from a degeneration to smaller systems that correspond 
to simplices in a regular subdivision of the support of the polynomials. We refer 
the reader to for general results on the use of these ideas to find regions of 
multistationarity in MESSI systems with a rational parametrization, as the enzy- 
matic examples in Examples[5.45]and [5.46] We also refer to Section 3 in for a 
summary and a clarifying example. This approach provides sufficient (but not nec- 
essary) open conditions in the space of all parameters (reaction rate constants and 
total amounts) were multistationarity occurs. Moreover, in principle regions with 
more than 3 stoichiometrically compatible steady states can be detected. Our re- 
sults have been implemented by Rick Rischter in a computer algebra system to find 
open regions in the space of all parameters where the 4-site and the 5-site distribu- 
tive sequential phosphorylation cycles have at least 5 stoichiometrically compatible 
positive steady stated, 


Using Other Methods: As we remarked in the examples, the number of pa- 
rameters is typically too high to try to naively use standard methods in computa- 
tional algebraic geometry. We refer the reader to for numerical methods 
and for symbolic methods to detect sample points in each chamber of the 


?MAPLE and SAGE files with these computations are available at: http://mate.dm.uba.ar/ 
~alidick/DGRPMFiles/. 
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complement of the discriminant (and boundary resultant complexes) in the space 
of parameters. Also, it is shown in that triangular forms can sometimes be 
used to detect multistationarity. Other authors explored the extrapolation of mul- 
tistationarity from subnetworks in various ways (see for instance [15}[84]|/223]) as 
the use of homotopy methods [114]. Researchers from other disciplines usually use 
numerical analysis and intensive parameter sampling. 


Some Open Questions. We have already mentioned two concrete open questions 
on the maximum possible number of stoichiometrically compatible positive steady 
states in the distributive sequential n-site cycle and in the ERK pathway. In the 
first case, there is a conjecture for this maximum possible number (2n — 1), but it 
seems that all degeneration techniques to simplify the resulting system do not have 
more have 2|n/2| +1 such positive steady states (as in [150)[168]), which makes 
the problem complicated. 

More generally, these are open questions in the study of sparse systems of 
polynomial equations with as many polynomials as variables: determine sharp upper 
and lower bounds for the number of positive solutions from the combinatorics of 
the supports, and from the interactions between the coefficients and the exponents 
of the system. This is only known when there is at most one root [277], an upper 
bound in the style of Descartes rule of signs for univariate polynomials is only 
known when the support of the polynomials consists of m+ 2 monomials in m 
variables [29], and in a few instances Khovanskii’s bound has been tightly refined 
as in the case of two trinomials in two variables [296]. 

It is also important to be able to predict from the structure of the network the 
maximum possible number of stoichiometrically compatible positive steady states 
which are stable. Given a reaction network with known parameters and a known 
steady state, stability can be decided without computing the roots of the character- 
istic polynomial of the Jacobian by means of the classical Routh-Hurwitz criterion. 
But this is a hard task with steady states that can be only approximated (which is 
the standard situation) and moreover when dealing with parametric systems. 

Another central question regarding the dynamics of biochemical reaction net- 
works is to predict or preclude the ocurrence of oscillations from the structure. 
There are several recent articles with this approach [14}{82\[85|[2001/284], but this 
is mainly an open field. It is very interesting that even for the distributed sequen- 
tial 2-site phosphorylation cycle it is not known whether it can have oscillatory 
solutions or not! Finally, once the capacity for oscillations has been determined, 
the next question would be to find oscillatory parameters and, better, oscillatory 
open regions in the space of parameters. 


Final Comments. In this section we gave an overview of the state of the art 
and open questions in the algebro-geometric study of biochemical reaction networks 
modeling signaling pathways in systems biology. They give rise to quite diverse, but 
not general mass-action dynamical systems, which allows us to prove mathematical 
results and do computations based on their structure, even for systems with many 
variables and many parameters. 
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